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BETA-DEFENSINS 

5 

BACKGROUND OF THE INVENTION 

Biological defense strategies have evolved to 
protect organisms from invasion by other species. 

10 Microbial infection response systems include oxidative and 
non- oxidative mechanisms, utilizing compounds that are 
enzymatically synthesized in cells and peptides that are 
single gene products. 

Anti-microbial peptides constitute an oxygen- 

15 independent host defense system found in organisms 
encompassing many taxonomic families. One major class of 
anti-microbial peptides can be sequence -de fined by 
conserved cysteine residue patterns and are termed 
defensins. Mammalian defensins, derived from skin, lung 

20 and intestine, exhibit antibiotic activity against a wide 
variety of pathogens, including gram-positive and gram- 
negative bacteria, fungi ( e.g. . Candida species) and 
viruses. See, for example, Porter et al . , Infect . Immun . 
65 (6) : 2396-401, 1997. 

25 The amphipathic character of the defensin 

peptides appears to be the key to the general mechanism of 
microbial attack, i.e. . by creating pores, or "boring" 
through the cell wall. In addition, Daher et al . , i. 
Virol . 60 (3) : 1068-74, 1986, reported that enveloped 

30 viruses, including herpes simplex types 1 and 2, 
cytomegalovirus and influenza virus (A/WSN) , among others, 
were inactivated by incubation with human neutrophil 
peptide (HNP-1) and speculated that the binding of 
defensin molecules to viruses impairs the virus' ability 

35 to infect cells. 

The defensin family of anti-microbial peptides 
can be divided into two major subclasses based on two 
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distinct consensus sequences. See, for example, Martin et 
al., Journal of Leukocyte Biology 58 : 128-36, 1995. The 
first defensin subclass, classic defensins, represented by 
HNPs are stored in the so-called large azurophil granules 
5 of neutrophils and macrophages and attack microorganisms 
that have been phagocytosed by these cells. The amino 
acid sequence of HNPs is consistent with a predicted 
disulfide bridging that is distinct from that of the p~ 
defensin subclass. Epithelial cells can also be a source 

10 of defensins, and these cells appear to secrete these 
peptides into the external, extra-cellular environment. 
In the mouse, for example, Paneth cells of the small 
intestine and proximal colon, secrete def ensin-like 
peptides, called crypt idins, into the lumen. See, for 

15 example, Ouellette and Selsted, The FASEB Journal 10.: 
1280-9, 1996. 

P-defensins, the second major defensin subclass, 
include peptides found in bovine lung ( e.g. . BNBD-bovine 
neutrophil P-defensins) as well as a secreted form (TAP - 

20 tracheal anti -microbial peptide) . See, for example, 
Selsted et al., J. Biol. Chem. 268 (9) ; 6641-8, 1993. Two 
human p-defensins have been reported. SAP-1 was isolated 
from human psoriatic skin, and hBD-1 was found in low 
concentrations in human blood filtrate. See, for example, 

25 Bensch et al., FEBS Lett. 368 (2) : 331-5, 1995). The amino 
acid sequence of these human P-defensins is most similar 
to the bovine BNDPs and TAP. See, for example, Harder et 
al., Nature 387 ; 861, 1997, wherein SAP-1 is designated 
hBD-2. 

30 Other than the conserved cysteine residues the 

defensin family is quite sequence divergent. It is 
possible that the variant amino acid positions may be 
related to the site or conditions of activity or to the 
spectrum of pathogens attacked by a particular defensin. 

35 In addition to anti-microbial activities, 

particular defensins exhibit metabolically sensitive 
cytotoxic activity (Lichtenstein et al . , Blood 68 : 1407- 
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10, 1986 and Sheu et al . , Antimicrob. Agents Chemother. 
28: 626-9, 1993) , alter the response of adrenal cortical 
cells to ACTH (Zhu et al., Proc. Natl. Acad. Sci . (USA) 
85 (2) ; 592-6, 1988) and have specific chemotactic activity 
5 for human monocytes (Territo et al . , J. Clin. Invest. 
84 (6) : 2017-20, 1989) . Recruitment of monocytes by 
neutrophils may, in part, be mediated by neutrophilic 
defensins and suggests a pro- inflammatory activity for 
these peptides in addition to their ant i -microbial 

10 effects. Also, a decrease in defensin mRNA level has been 
demonstrated in SPG (specific granule disease) . See, for 
example, Tamura et al . , Japan. Int. J. Hematol . 59(2) : 
137-42, 1994. Higazi et al . , J. Biol. Chem. 271(3) : 17650- 
5, 1996, suggested that plasminogen bound to fibrin in the 

15 presence of defensin may be less susceptible to activation 
by tPA. 

Moieties having anti-microbial, 

immunostimulatory, pro- inflammatory and other properties 
of defensins are sought. The present invention provides 
20 such polypeptides for these and other uses that should be 
apparent to those skilled in the art from the teachings 
herein. 

SUMMARY OF THE INVENTION 

25 Within one aspect the invention provides an 

isolated protein comprising a polypeptide that is at least 
80% identical to a polypeptide selected from the group 
consisting of: a) a polypeptide having the sequence of 
amino acid residue 1 to amino acid residue 65 of SEQ ID 

30 NO: 2; b) a polypeptide having the sequence of amino acid 
residue 19 to amino acid residue 65 of SEQ ID NO:2; c) a 
polypeptide having the sequence of amino acid residue 21 
to amino acid residue 65 of SEQ ID NO: 2; d) a polypeptide 
having the sequence of amino acid residue 1 to amino acid 

35 residue 67 of SEQ ID NO: 10; e) a polypeptide having the 
sequence of amino acid residue 21 to amino acid residue 67 
of SEQ ID NO: 10; and f) a polypeptide having the sequence 
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of amino acid residue 23 to amino acid residue 67 of SEQ 
ID NO: 2; wherein the polypeptide has cysteine residues 
corresponding to amino acid residues 33, 40, 45, 55, 62 
and 63 of SEQ ID NOs : 2 or 10. Within one embodiment the 
5 protein comprises a polypeptide having the sequence 
selected from the group consisting of : a) a polypeptide 
having the sequence of amino acid residue 1 to amino acid 
residue 67 of SEQ ID NO: 10; b) a polypeptide having the 
sequence of amino acid residue 21 to amino acid residue 67 
10 of SEQ ID NO: 10; and c) a polypeptide having the sequence 
of amino acid residue 23 to amino acid residue 67 of SEQ 
ID NO: 10. 

Within another aspect is provided an isolated 
protein having the sequence of SEQ ID NO: 10 from amino 
15 acid residue 23 to amino acid residue 67. 

Within still another aspect is provided a 
pharmaceutical composition comprising a protein as 
described above in combination with a pharmaceutical ly 
acceptable vehicle. 
20 Within yet another aspect is provided an 

antibody that specifically binds to a protein as described 
above . 

Within a further aspect is provided an anti- 
idiotypic antibody of an antibody which specifically binds 

25 to a protein as described above. 

Within another aspect is provided an isolated 
polynucleotide molecule encoding a protein, the 
polynucleotide molecule consisting of a coding strand and 
a complementary non-coding strand, wherein the 

30 polynucleotide molecule encodes a polypeptide that is at 
least 80% identical to the amino acid sequence to a 
polypeptide selected from the group consisting of: a) a 
polypeptide having the sequence of amino acid residue 1 to 
amino acid residue 65 of SEQ ID N0:2; b) a polypeptide 

35 having the sequence of amino acid residue 19 to amino acid 
residue 65 of SEQ ID N0:2; c) a polypeptide having the 
sequence of amino acid residue 21 to amino acid residue 65 
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of SEQ ID N0:2; d) a polypeptide having the sequence of 
amino acid residue 1 to amino acid residue 67 of SEQ ID 
NO: 10; e) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 67 of SEQ ID NO: 10; and 
5 f) a polypeptide having the sequence of amino acid residue 
23 to amino acid residue 67 of SEQ ID NO: 2; wherein the 
polypeptide has cysteine residues corresponding to amino 
acid residues 33, 40, 45, 55, 62 and 63 of SEQ ID NOs:2 or 
10. 

10 Within another aspect the invention provides an 

isolated polynucleotide molecule encoding a protein, having 
cysteine residues corresponding to amino acid residues 33, 
40, 45, 55, 62 and 63 of SEQ ID NO:10, the polynucleotide 
molecule consisting of a coding strand and a complementary 

15 non- coding strand, wherein the polynucleotide comprises a 
nucleotide sequence that is at least 80% identical to the 
sequence of a polynucleotide selected from the group 
consisting of: a) a polynucleotide as shown in SEQ ID NO: 9 
from nucleotide 220 to nucleotide 420; b) a polynucleotide 

20 as shown in SEQ ID NO: 9 from nucleotide 280 to nucleotide 
420; and c) a polynucleotide as shown in SEQ ID NO: 9 from 
nucleotide 286 to nucleotide 420. 

Within yet another aspect is provided an 
isolated polynucleotide molecule encoding a protein having 

25 cysteine residues corresponding to amino acid residues 33, 
40, 45, 55, 62 and 63 of SEQ ID N0:10, the polynucleotide 
molecule consisting of a coding strand and a complementary 
non-coding strand, wherein the polynucleotide comprises a 
nucleotide sequence as shown in SEQ ID NO: 11. 

30 Within still another aspect is provided an 

expression vector comprising the following operably linked 
elements: a transcription promoter; a DNA segment encoding 
a protein as described above; and a transcription 
terminator. Within one embodiment the DNA segment further 

35 encodes a secretory signal sequence operably linked to the 
protein. Within a related embodiment the secretory signal 
sequence is selected from the group consisting of: a) a 
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polypeptide having the sequence of amino acid residue 1 to 
amino acid residue 18 of SEQ ID NO: 2; b) a polypeptide 
having the sequence of amino acid residue 1 to amino acid 
residue 20 of SEQ ID NO: 2; c) a polypeptide having the 
5 sequence of amino acid residue 1 to amino acid residue 20 
of SEQ ID NO: 10; and d) a polypeptide having the sequence 
of amino acid residue 1 to amino acid residue 22 of SEQ ID 
NO: 10. 

Within another aspect the invention provides a 
10 cultured cell into which has been introduced an expression 
vector comprising the following operably linked elements: 
a transcription promoter; a DNA segment encoding a protein 
as described above; and a transcription terminator; 
wherein the cell expresses the protein encoded by the DNA 
15 segment. 

Within a further aspect is provided a method of 
producing a protein comprising: culturing a cell into 
which has been introduced an expression vector comprising 
the following operably linked elements: a transcription 

2 0 promoter; a DNA segment encoding a protein as described 
above; and a transcription terminator; whereby the cell 
expresses the protein encoded by the DNA segment; and 
recovering the expressed protein. 

The invention also provides an oligonucleotide 

25 probe or primer comprising at least 14 contiguous 
nucleotides of a polynucleotide of SEQ ID NO: 11 or a 
sequence complementary to SEQ ID NO: 11. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the three disulfide bond 
structure of the conserved p-defensin motif. 

Figure 2 illustrates a multiple alignment of 
5 mature, processed human SAP-1 (see, for example, Bensch et 
al., FEBS Lett . 368 (2) : 331-5, 1995) and the zampl 
polypeptide of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

10 Prior to setting forth the invention in detail, 

it may be helpful to the understanding thereof to define 
the following terms. 

The term "affinity tag" is used herein to denote 
a peptide segment that can be attached to a polypeptide to 

15 provide for purification of the polypeptide or provide 
sites for attachment of the polypeptide to a substrate. 
In principal, any peptide or protein for which an antibody 
or other specific binding agent is available can be used 
as an affinity tag. Affinity tags include a poly- 

20 histidine tract, protein A (Nilsson et al., EMBO J. 
4:1075, 1985; Nilsson et al . , Methods Enzymol . 198:3, 
1991) , glutathione S transferase (Smith and Johnson, Gene 
£7:31, 1988), substance P, Flag™ peptide (Hopp et al . , 
Biotechnology 6: 1204 -1210, 1988; available from Eastman 

25 Kodak Co., New Haven, CT) , streptavidin binding peptide, 
or other antigenic epitope or binding domain. See, in 
general Ford et al . , Protein Expression and Purification 

2: 95-107, 1991. DNAs encoding affinity tags are 

available from commercial suppliers (e.g., Pharmacia 

30 Biotech, Piscataway, NJ) . 

The term "allelic variant" denotes any of two or 
more alternative forms of a gene occupying the same 
chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic 

35 polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may 
encode polypeptides having altered amino acid sequence. 
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The term allelic variant is also used herein to denote a 
protein encoded by an allelic variant of a gene. 

The terms "amino- terminal" and " carboxyl - 
terminal" are used herein to denote positions within 
5 polypeptides and proteins. Where the context allows, 
these terms are used with reference to a particular 
sequence or portion of a polypeptide or protein to denote 
proximity or relative position. For example, a certain 
sequence positioned carboxyl- terminal to a reference 
10 sequence within a protein is located proximal to the 
carboxyl terminus of the reference sequence, but is not 
necessarily at the carboxyl terminus of the complete 
protein. 

The term "complement/anti-complement pair" 

15 denotes non- identical moieties that form a non-covalently 
associated, stable pair under appropriate conditions . For 
instance, biotin and avidin (or streptavidin) are 
prototypical members of a complement /ant i -complement pair. 
Other exemplary complement/anti-complement pairs include 

20 receptor/ligand pairs, antibody/antigen (or hapten or 
epitope) pairs, sense/antisense polynucleotide pairs, and 
the like. Where subsequent dissociation of the 

complement /anti- complement pair is desirable, the 
complement/anti-complement pair preferably has a binding 

25 affinity of <10 9 M" 1 . 

The term "complements of polynucleotide 
molecules" denotes polynucleotide molecules having a 
complementary base sequence and reverse orientation as 
compared to a reference sequence. For example, the 

30 sequence 5 ' ATGCACGGG 3 1 is complementary to 5 ' CCCGTGCAT 
3 ' . 

The term "degenerate nucleotide sequence" 
denotes a sequence of nucleotides that includes one or 
more degenerate codons (as compared to a reference 
35 polynucleotide molecule that encodes a polypeptide) . 
Degenerate codons contain different triplets of 
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nucleotides, but encode the same amino acid residue (i.e., 
GAU and GAC triplets each encode Asp) . 

The term "expression vector" denotes a DNA 
molecule, linear or circular, that comprises a segment 
5 encoding a polypeptide of interest operably linked to 
additional segments that provide for its transcription. 
Such additional segments may include promoter and 
terminator sequences, and may optionally include one or 
more origins of replication, one or more selectable 

10 markers, an enhancer, a polyadenylation signal, and the 
like. Expression vectors are generally derived from 
plasmid or viral DNA, or may contain elements of both. 

The term "isolated", when applied to a 
polynucleotide molecule, denotes that the polynucleotide 

15 has been removed from its natural genetic milieu and is 
thus free of other extraneous or unwanted coding 
sequences, and is in a form suitable for use within 
genetically engineered protein production systems. Such 
isolated molecules are those that are separated from their 

20 natural environment and include cDNA and genomic clones. 
Isolated DNA molecules of the present invention are free 
of other genes with which they are ordinarily associated, 
but may include naturally occurring 5 ' and 3 ' untranslated 
regions such as promoters and terminators . The 

25 identification of associated regions will be evident to 
one of ordinary skill in the art (see for example, Dynan 
and Tijan, Nature 316:774-78, 1985). When applied to a 
protein, the term "isolated" indicates that the protein is 
found in a condition other than its native environment, 

3 0 such as apart from blood and animal tissue. In a 
preferred form, the isolated protein is substantially free 
of other proteins, particularly other proteins of animal 
origin. It is preferred to provide the protein in a 
highly purified form, i.e., greater than 95% pure, more 

35 preferably greater than 99% pure. 

The term "operably linked", when referring to 
DNA segments, denotes that the segments are arranged so 
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that they function in concert for their intended purposes, 
e.g. transcription initiates in the promoter and proceeds 
through the coding segment to the terminator. 

The term "ortholog" denotes a polypeptide or 
5 protein obtained from one species that is the functional 
counterpart of a polypeptide or protein from a different 
species. Sequence differences among orthologs are the 
result of speciation. 

The term "ortholog" denotes a polypeptide or 

10 protein obtained from one species that is the functional 
counterpart of a polypeptide or protein from a different 
species. Sequence differences among orthologs are the 
result of speciation. 

The term "polynucleotide" denotes a single- or 

15 double- stranded polymer of deoxyribonucleotide or 
ribonucleotide bases read from the 5' to the 3' end. 
Polynucleotides include RNA and DNA, and may be isolated 
from natural sources, synthesized in vitro, or prepared 
from a combination of natural and synthetic molecules. 

2 0 Sizes of polynucleotides are expressed as base pairs 
(abbreviated "bp"), nucleotides ("nt"), or kilobases 
("kb") . Where the context allows, the latter two terms 
may describe polynucleotides that are single-stranded or 
double -stranded. When the term is applied to double - 

25 stranded molecules it is used to denote overall length and 
will be understood to be equivalent to the term "base 
pairs". It will be recognized by those skilled in the art 
that the two strands of a double- stranded polynucleotide 
may differ slightly in length and that the ends thereof 

30 may be staggered as a result of enzymatic cleavage; thus 
all nucleotides within a double- stranded polynucleotide 
molecule may not be paired. Such unpaired ends will in 
general not exceed 20 nt in length. 

A "polypeptide" is a polymer of amino acid 

35 residues joined by peptide bonds, whether produced 
naturally or synthetically. Polypeptides of less than 



WO 99/13080 



PCT/US98/19222 



11 

about 10 amino acid residues are commonly referred to as 
"peptides" . 

The term "promoter" denotes a portion of a gene 
containing DNA sequences that provide for the binding of 
5 RNA polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. 

The term "receptor" denotes a cell-associated 
protein that binds to a bioactive molecule (i.e., a 

10 ligand) and mediates the effect of the ligand on the cell. 
Membrane -bound receptors are characterized by a multi- 
domain structure comprising an extracellular ligand- 
binding domain and an intracellular effector domain that 
is typically involved in signal transduction. Binding of 

15 ligand to receptor results in a conformational change in 
the receptor that causes an interaction between the 
effector domain and other molecule (s) in the cell. This 
interaction in turn leads to an alteration in the 
metabolism of the cell. Metabolic events that are linked 

20 to receptor- ligand interactions include gene 

transcription, phosphorylation, dephosphorylation, 

increases in cyclic AMP production, mobilization of 
cellular calcium, mobilization of membrane lipids, cell 
adhesion, hydrolysis of inositol lipids and hydrolysis of 

25 phospholipids. Most nuclear receptors also exhibit a 
multi -domain structure, including an amino- terminal, 
trans activating domain, a DNA binding domain and a ligand 
binding domain. In general, receptors can be membrane 
bound, cytosolic or nuclear; monomeric (e.g., thyroid 

30 stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric (e.g., PDGF receptor, growth hormone receptor, 
IL-3 receptor, GM-CSF receptor, G-CSF receptor, 
erythropoietin receptor and IL-6 receptor) . 

The term "secretory signal sequence" denotes a 

35 DNA sequence that encodes a polypeptide (a "secretory 
peptide") that, as a component of a larger polypeptide, 
directs the larger polypeptide through a secretory pathway 
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of a cell in which it is synthesized. The larger peptide 
is commonly cleaved to remove the secretory peptide during 
transit through the secretory pathway. 

A "soluble receptor" is a receptor polypeptide 
5 that is not bound to a cell membrane. Soluble receptors 
are most commonly ligand-binding receptor polypeptides 
that lack transmembrane and cytoplasmic domains . Soluble 
receptors can comprise additional amino acid residues, 
such as affinity tags that provide for purification of the 
10 polypeptide or provide sites for attachment of the 
polypeptide to a substrate, or immunoglobulin constant 
region sequences. Many cell -surface receptors have 

naturally occurring, soluble counterparts that are 
produced by proteolysis or translated from alternatively 
15 spliced mRNAs. Receptor polypeptides are said to be 
substantially free of transmembrane and intracellular 
polypeptide segments when they lack sufficient portions of 
these segments to provide membrane anchoring or signal 
transduction, respectively. 
20 Molecular weights and lengths of polymers 

determined by imprecise analytical methods (e.g., gel 
electrophoresis) will be understood to be approximate 
values. When such a value is expressed as "about" X or 
"approximately" X, the stated value of X will be 
25 understood to be accurate to ±10%. 

All references cited herein are incorporated by 
reference in their entirety. 

The present invention is based in part upon the 
discovery of a novel DNA sequence that encodes a 
30 polypeptide having homology to proteins of the P-defensin 
family. That is, the zampl polypeptides of the present 
invention exhibit a conserved motif shown in SEQ ID NO: 3 
and herein: C (X) 6 C (X) ,C (X) -,GXC (X) 6 CC wherein "(X)" is the 
number of preferably non-cysteine amino acid residues 
35 between specific amino acids. The cysteine position and 
spacing is characteristic of the P-defensin family. In 
addition, the QIG tripeptide motif embedded in the 
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conserved motif shown in SEQ ID NO: 3 occurs in several 
members of the P-defensin family (for example, SAP-l/hBD- 
2, BNBDs, TAP and the like) . This motif is interpreted to 
indicate the presence of three disulfide bonds in the (3- 
5 defensin structure. Those disulfide bonds are shown in 
Fig. 1. In addition, an intron sequence of approximately 
900 base pairs is found in genomic DNA sequence encoding 
the zampl polypeptide. This intron sequence is inserted 
between the two guanine residues in the codon encoding the 

10 glycine residue at amino acid position 20 in SEQ ID NO: 
10. Such intron placement, in the area between the signal 
sequence and the mature protein occurs in other members of 
the (3-defensin family. 

A standard Northern blot tissue distribution of 

15 the mRNA corresponding to this novel DNA revealed no 
expression. It thus appears that normal tissue levels of 
mRNA of zampl polypeptide are below the detection 
sensitivity of the Northern blot. Such an observation is 
consistent with the knowledge in the art regarding 

20 defensins, i.e. , that they are constitutively expressed at 
low levels but are highly inducible upon infection. 
Electronic analysis of tissue distribution based upon 
libraries where the sequence is found indicate that zampl 
polypeptide is expressed in bronchial epithelia. 

25 The novel zampl polypeptides of the present 

invention were initially identified by querying an EST 
database for homologous sequences to the SAP-1 human 
defensin isolated from human psoriatic skin. A single EST 
sequence was discovered in a bronchial epithelium cDNA 

30 library and was predicted to be related to the P-defensin 
family. A second search based upon the P-defensin 
consensus motif also identified the EST. Thus, the 
consensus motif is found in the zampl polypeptide as well 
as in the SAP-1 protein; however, the remaining sequence 

35 of the two proteins is divergent, characterized by 
approximately 43% identity at the amino acid level. See, 
for example, the multiple alignment shown in Fig. 2. 
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The nucleotide sequence of the zampl polypeptide 
is described in SEQ ID NO: 1 and SEQ ID NO: 9, and its 
deduced amino acid sequence is described in SEQ ID NO: 2 
and SEQ ID NO: 10, respectively. The zampl polypeptide, 
5 by sequence analysis, can be grouped with the two human P~ 
defensins, hBD-1 and hBD-2 (SAP-l) , but it is most closely 
sequence -related to hBD-2 and the bovine BNBDs and less 
similar to hBD-1. 

Preliminary computer-aided model building 

10 efforts to construct a three-dimensional model structure 
for zampl polypeptide indicate that it is feasible to 
generate physically reasonable model structures using 
BNBD_12 (Zimmermann et al . , Biochemistry 34 (41) : 13663- 
13671, 1995) as a template. Although there is relatively 

15 low sequence identity between these two peptides, their 
overall secondary structure is very similar. The most 
variability is observed in the loop regions, which is not 
alarming since the loop segments represent one of many 
possible conformations for each loop. Both structures are 

20 built primarily of an anti-parallel beta sheet core, four- 
stranded in the BNBD_12 model and three- stranded in the 
zampl polypeptide model . A turn formed between two of the 
beta strands in the BNBD_12 chain is also found in the 
zampl polypeptide model connecting two beta strands. The 

25 overall folding of BNBD_12 follows the pattern of beta 
strand/short beta strand/short beta strand/turn/beta 
strand. Folding of zampl polypeptide consists of beta 
strand/beta strand/turn/beta strand. These common 

structural elements are highly superimposable . Thus, the 
3 0 two polypeptides may well be involved in the same or 
similar biological processes. 

Another aspect of the present invention includes 
zampl polypeptide fragments . Preferred fragments include 
the leader sequence, ranging from amino acid 1 (lie) to 
35 amino acid 18 (Gly) or 20 (Gly) of SEQ ID NO: 2 and 
ranging from amino acid 1 (Met) to amino acid 20 (Gly) or 
22 (Gly) of SEQ ID NO: 10. Such leader sequences may be 
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used to direct the secretion of other polypeptides. Such 
fragments of the present invention may be used as follows: 
the alternative secretion leader fragments are formed as 
fusion proteins with alternative proteins selected for 
5 secretion; plasmids bearing regulatory regions capable of 
directing the expression of the fusion protein are 
introduced into test cells; and secretion of the protein 
is monitored. 

The present invention also provides fusion 

10 constructs incorporating the zampl polypeptide selected 
from the group consisting of: (a) polypeptide molecules 
comprising a sequence of amino acid residues as shown in 
SEQ ID N0:2 from amino acid residue 1 (lie), 19 (His) or 
21 (Gly) to amino acid residue 65 (Lys) or a sequence of 

15 amino acid residues as shown in SEQ ID NO: 10 from amino 
acid residue 1 (Met) , 21 (His) or 23 (Gly) to amino acid 
residue 67 (Lys) ; or (b) mammalian species homologs or 
human paralogs of (a) ; at least the mature polypeptide 
region of another defensin molecule; and, optionally, a 

20 polypeptide linker there between. When defensin molecules 
having disparate spectrum of pathogens, fusion constructs 
containing the same are expected to exhibit a broader 
range of anti-microbial effectiveness. Polypeptide 
linkers are preferably employed if necessary to provide 

25 separation of component polypeptides of the fusion or to 
allow for flexibility of the fusion protein, thereby 
preserving the anti-microbial activity of each defensin 
component of the fusion protein. Those of ordinary skill 
in the art are capable of designing such linkers. 

30 The highly conserved amino acids in the 

consensus domain of zampl polypeptide can be used as a 
tool to identify new family members. For instance, 
reverse transcription-polymerase chain reaction (RT-PCR) 
can be used to amplify sequences encoding the conserved 

35 motif from RNA obtained from a variety of tissue sources. 
More specifically, the following probes can be employed to 
identify other human or zampl- like P-defensins. A 
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preferred embodiment of this aspect of the present 
invention ranges between amino acid residues 31 and 61 of 
SEQ ID NO: 2 (corresponding to nucleotides 91-183 of SEQ 
ID NO: 1) . In particular, highly degenerate primers 
5 designed from the above sequences are useful for this 
purpose . 

SEQ ID NO: 4 is a degenerate polynucleotide 
sequence that encompasses all polynucleotides that encode 
the zampl polypeptide of SEQ ID NO: 2 (amino acids 1-65). 
It SEC ID NO: 11 is a degenerate polynucleotide sequence that 
encompasses all polynucleotides that encode the zampl 
polypeptide of SEQ ID NO: 10. Thus, zampl polypeptide- 
encoding polynucleotides ranging from nucleotide l, 61 or 
€■? to nucleotide 195 or 213 of SEQ ID NO: 4 and ranging 
15 from nucleotide 1, 61 or 67 to nucleotide 201 or 219 of 
SEQ ID NO: 11 are contemplated by the present invention. 
Also contemplated by the present invention are fragments 
and fusions as described above with respect to SEQ ID NO: 
1 and SEQ ID NO: 10, which are formed from analogous 
regions of SEQ ID NO: 4 and SEQ ID NO: ll. The symbols in 
SEQ ID NO: 4 are summarized in Table l below. 



20 
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TABLE 1 

Complement Resolutions 



T 


T 


G 


G 


C 


C 


A 


A 


Y 


C|T 


R 


A|G 


K 


G|T 


M 


A|C 


S 


A|T 


W 


C|G 


D 


A|G|T 


V 


A|C|G 


B 


C|G[T 


H 


A|C|T 


N 


A|C|G|T 



5 The degenerate codons used in SEQ ID NO: 4 and 

SEQ ID NO: 11, encompassing all possible codons for a 
given amino acid, are set forth in Table 2 below. 
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TABLE 2 



Amino 


Letter 


Codons 


Degenerate 


Acid 






Codon 


Cys 


C 


TGC TGT 


TGY 


Ser 


S 


AGC AGT TCA TCC TCG TCT 


WSN 


Thr 


T 


ACA ACC ACG ACT 


ACN 


Pro 


P 


CCA CCC CCG CCT 


CCN 


Ala 


A 


GCA GCC GCG GCT 


GCN 


Gly 


G 


GGA GGC GGG GGT 


GGN 


Asn 


N 


AAC AAT 


AAY 


Asp 


D 


GAC GAT 


GAY 


Glu 


E 


GAA GAG 


GAR 


Gin 


Q 


CAA CAG 


CAR 


His 


H 


CAC CAT 


CAY 


Arg 


R 


AGA AGG CGA CGC CGG CGT 


MGN 


Lys 


K 


AAA AAG 


AAR 


Met 


M 


ATG 


ATG 


He 


I 


ATA ATC ATT 


ATH 


Leu 


L 


CTA CTC CTG CTT TTA TTG 


YTN 


Val 


V 


GTA GTC GTG GTT 


GTN 


Phe 


F 


TTC TTT 


TTY 


Tyr 


Y 


TAC TAT 


TAY 


Trp 


W 


TGG 


TGG 


Ter 




TAA TAG TGA 


TRR 


Asn | Asp 


B 




RAY 


Glu|Gln 


Z 




SAR 


Any 


X 




NNN 


Gap 









One of ordinary skill in the art will appreciate 
5 that some ambiguity is introduced in determining a 
degenerate codon, representative of all possible codons 
encoding each amino acid. For example, the degenerate 
codon for serine (WSN) can, in some circumstances, encode 
arginine (AGR) , and the degenerate codon for arginine 
10 (MGN) can, in some circumstances, encode serine (AGY) . A 
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similar relationship exists between codons encoding 
phenylalanine and leucine. Thus, some polynucleotides 
encompassed by the degenerate sequence may encode variant 
amino acid sequences, but one of ordinary skill in the art 
5 can easily identify such variant sequences by reference to 
the amino acid sequence of SEQ ID NO: 2 and SEQ ID NO: 10. 
Variant sequences can be readily tested for functionality 
as described herein. 

Within another aspect of the present invention 

10 there is provided a pharmaceutical composition comprising 
purified zampl polypeptide in combination with a 
pharmaceutically acceptable vehicle. Such pharmaceutical 
compositions are used in the treatment of conditions 
associated with pathological microbes, including 

15 bacterial, fungal and viral infections. Antibacterial 
applications of zampl polypeptide include situations where 
the pathogen has become resistant to standard treatments. 
For example, hospital sepsis is an increasing problem, 
since Staphylococcus strains have become resistant to 

20 commonly used antibiotics. 

In general, ant i -microbial activity of zampl 
polypeptides, fragments, fusions, antibodies, agonists and 
antagonists can be evaluated by techniques that are known 
in the art. More specifically, ant i -microbial activity 

25 can be assayed by evaluating the sensitivity of microbial 
cell cultures to test agents and by evaluating the 
protective effect of test agents on infected mice. See, 
for example, Musiek et al., Antimicrob. Agents Chemothr. 
3.: 40, 1973. Antiviral activity can also be assessed by 

30 protection of mammalian cell cultures. Known techniques 
for evaluating ant i -microbial activity include, for 
example, Barsum et al . , Eur. Respir. J. 8 (5) : 709-14, 
1995; Sandovsky-Losica et al . , J. Med. Vet. Mvcol 
(England) 28 (4) : 279-87, 1990; Mehentee et al., J. Gen. 

35 Microbiol (England) 135 (Pt . 8) : 2181-8, 1989; Segal and 
Savage , Journal of Medical and Veterinary Mycology 24 : 
477-479, 1986 and the like. Known assays specific for 
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anti-viral activity include, for example, those described 
by Daher et al., J. Virol. 60 (3) : 1068-74, 1986. 

In addition, contract laboratories offer 
services in evaluating ant i -microbial properties. For 
5 example, Panlabs, Inc. of Bothell, Washington offer in 
vitro or in vivo testing for bacteria, gram negative 
(Enterobacter cloacae, Escherichia coli, Klebsiella 
pneumonia, Proteus vulgaris, Pseudomonas aeruginosa, 
Salmonella typhimurium and Serratia marcescens) , gram 

10 positive {Bacillus subtilis, Brevebacterium ammoniagenes , 
Corynebacterium minutissimum, Micrococcus luteus, 
Mycobacterium ranae, Staphylococcus strains and 
Streptococcus strains) and anaerobic organisms 
(Actinomyces viscosus, Bacteroides fragilis , Clostridium 

15 sporogenes, Corynebacterium acnes, Helicobacter pylori and 
Porphyromonas gingivalis) , as well as for protozoa 
(Trichomonas foetus) and fungi ( e.g. . Candida albicans , 
Epidermophyton floccosum, Exophiala jeanselmei , 

Microsporum strains, Trichophyton strains and the like) . 

20 Also, Molecular Probes of Oregon has commercially 
available fluorescence technology for use in bacteriology. 

If desired, zampl polypeptide, fragment, fusion 
protein, agonist, antagonist or antibody performance in 
this regard can be compared to proteins known to be 

25 functional in this regard, such as proline-rich proteins, 
lysozyme, histatins, lactoperoxidase or the like. In 
addition, zampl polypeptide, fragment, fusion protein, 
antibody, agonist or antagonist may be evaluated in 
combination with one or more ant i -microbial agents to 

30 identify synergistic effects. 

The pharmaceutical compositions of the present 
invention may also be used when pro- inflammatory activity 
is desired. Applications for such pro- inflammatory 
activity include the treatment of chronic tissue damage, 

35 particularly in areas having a limited or damaged vascular 
system, e.g. , damage in extremities associated with 
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diabetes. In contrast, antagonists to zampl polypeptides 
may be useful as ant i- inflammatory agents. 

Zampl polypeptide pharmaceutical compositions of 
the present invention may also be used in the treatment of 
5 conditions where stimulation of immune responsiveness is 
desired. Such conditions include the treatment of 
patients having incompetent immune systems, such as AIDs 
patients or individuals that have undergone chemotherapy, 
radiation treatment or the like. 

10 Because zampl polypeptide was found in a 

bronchial epithelia library and cystic fibrosis is 
characterized by frequent microbial infection, 
pharmaceutical compositions containing zampl polypeptide 
are also contemplated for use in the treatment of lung 

15 infections associated with cystic fibrosis. Also 
contemplated by the present invention are engineered zampl 
polypeptides that are characterized by decreased 
sensitivity to salt concentration. Decreased sensitivity 
to high salt concentration will preserve ant i -microbial 

20 activity of engineered zampl polypeptides in high salt 
environments, such as in the lung airways of patients 
suffering from cystic fibrosis. In this manner, 

pharmaceutical compositions containing engineered zampl 
polypeptides that are formulated for delivery to the lungs 

25 can be used to treat lung infections associated with 
cystic fibrosis. 

Radiation hybrid mapping is a somatic cell 
genetic technique developed for constructing high- 
resolution, contiguous maps of mammalian chromosomes (Cox 

30 et al., Science 250:245-50, 1990). Partial or full 
knowledge of a gene's sequence allows one to design PCR 
primers suitable for use with chromosomal radiation hybrid 
mapping panels. Commercially available radiation hybrid 
mapping panels which cover the entire human genome, such 

35 as the Stanford G3 RH Panel and the GeneBridge 4 RH Panel 
(Research Genetics, Inc., Huntsville, AL) , are available. 
These panels enable rapid, PCR-based chromosomal 
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localizations and ordering of genes, sequence -tagged sites 
(STSs) , and other nonpolymorphic and polymorphic markers 
within a region of interest. This includes establishing 
directly proportional physical distances between newly 
5 discovered genes of interest and previously mapped 
markers. The precise knowledge of a gene's position can 
be useful for a number of purposes, including: 1) 
determining if a sequence is part of an existing contig 
and obtaining additional surrounding genetic sequences in 

10 various forms, such as YACs, BACs or cDNA clones; 2) 
providing a possible candidate gene for an inheritable 
disease which shows linkage to the same chromosomal 
region; and 3) cross-referencing model organisms, such as 
mouse, which may aid in determining what function a 

15 particular gene might have. 

The results showed that the zampl gene maps 33.5 
cR_3000 from the top of the human chromosome 8 linkage 
group on the WICGR radiation hybrid map. Proximal and 
distal framework markers were CHLC. GATA62D10 and WI-3823 

20 (D8S1511) , respectively. The use of surrounding markers 
positions the zampl gene in the 8p23.3-p23.2 region on the 
integrated LDB chromosome 8 map (The Genetic Location 
Database, University of Southhampton, WWW server: 
http: //cedar. genetics, soton.ac.uk/ public_html/) . 

25 Previously, human defensin genes of both 

hematopoietic (such as HD-1, as described by Sparkes et 
al.. Genomics 5 (2) : 240-4, 1989) and epithelial (such as 
HD-5 and HD-6, as described by Bevins et al . , Genomics 
31 (1) : 95-106, 1996) origin are localized on the short arm 

30 of human chromosome 8 (8p23) . Several defensin genes, 
crypt idins, have been mapped in the mouse genome and are 
found in a region of conserved synteny with human on mouse 
chromosome 8. See, for example, Ouellette et al . , 
Genomics 5(2) : 233-9, 1989. Recently, Liu et al . , 

35 Genomics 43 (3) : 316-20, 1997, reported the mapping of the 
hBD_l gene to the same cluster of defensins on chromosome 
8. These authors propose that a- and p-defensin genes 
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arose from a common ancestral gene prior to mammalian 
divergence. Thus, the localization of the zampl 

polypeptide -encoding gene to this region of chromosome 8 
adds a second human fi-defensin to the same chromosomal 
5 location as the human classic defensins and supports the 
hypothesis for the evolution of defensins. 

The present invention also provides reagents 
which will find use in diagnostic applications. For 
example, the zampl gene, a probe comprising zampl DNA or 

10 RNA or a subsequence thereof can be used to determine if 
the zampl gene is present on chromosome 8 or if a mutation 
has occurred. Detectable chromosomal aberrations at the 
zampl gene locus include but are not limited to 
aneuploidy, gene copy number changes, insertions, 

15 deletions, restriction site changes and rearrangements. 
Such aberrations can be detected using polynucleotides of 
the present invention by employing molecular genetic 
techniques, such as restriction fragment length 
polymorphism (RFLP) analysis, short tandem repeat (STR) 

20 analysis employing PCR techniques, and other genetic 

linkage analysis techniques known in the art (Sambrook et 

al., ibid . ; Ausubel, et . al . , ibid . ; Marian, Chest , 108 : 

255-265, 1995) . 

Another aspect of the present invention involves 

25 the detection of zampl polypeptides in cell culture or in 
a serum sample or tissue biopsy of a patient undergoing 
evaluation for SPG, Chediak-Higashi syndrome or other 
conditions characterized by an alteration in defensin 
concentration. Zampl polypeptides can be detected using 

30 immunoassay techniques and antibodies capable of 
recognizing a zampl polypeptide epitope. More 
specifically, the present invention contemplates methods 
for detecting zampl polypeptide comprising: 

exposing a solution or sample or cell culture 

35 lysate or supernatant, possibly containing zampl 
polypeptide, to an antibody attached to a solid support, 
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wherein said antibody binds to a first epitope of a zatnpl 
polypeptide ; 

washing said immobilized antibody- polypeptide to 
remove unbound contaminants; 
5 exposing the immobilized antibody-polypeptide to 

a second antibody directed to a second epitope of a zampl 
polypeptide, wherein the second antibody is associated 
with a detectable label; and 

detecting the detectable label. Zampl 
•10 polypeptide concentration differing from that of controls 
may be indicative of SPG, Chediak-Higashi syndrome or 
other conditions characterized by an alteration in 
defensin concentration. In addition, expression of zampl 
may be monitored in cystic fibrosis patients as a 

15 predictor of the onset of infectious crises. Also, high 
defensin, such as zampl polypeptide, levels have been 
associated with cytotoxic effects in lung, indicating that 
zampl polypeptide levels can be used to direct treatment 
for averting or addressing such cytotoxicity. For 

20 example, antibodies directed to zampl polypeptide can be 
administered to inactivate the same in a treatment 
modality. 

Within additional aspects of the invention there 
are provided antibodies or synthesized binding 
25 proteins ( e.g. . those generated by phage display, E. coli 

Fab, and the like) that specifically bind to the zampl 
polypeptides described above. Such antibodies are useful 
for, among other uses as described herein, preparation of 
ant i- idiotypic antibodies. Synthesized binding proteins 

3 0 may be produced by phage display using commercially 
available kits, such as the Ph.D.™ Phage Display Peptide 
Library Kits available from New England Biolabs, Inc. 
(Beverly, Massachusetts) . Phage display techniques are 
described, for example, in US Patent Nos . 5,223,409, 

35 5,403,484 and 5,571,698. 

An additional aspect of the present invention 
provides methods for identifying agonists or antagonists 
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of the zampl polypeptides disclosed above, which agonists 
or antagonists may have valuable properties as discussed 
further herein. Within one embodiment, there is provided 
a method of identifying zampl polypeptide agonists, 
5 comprising providing cells responsive thereto, culturing 
the cells in the presence of a test compound and comparing 
the cellular response with the cell cultured in the 
presence of the zampl polypeptide, and selecting the test 
compounds for which the cellular response is of the same 
10 type. 

Within another embodiment, there is provided a 
method of identifying antagonists of zampl polypeptide, 
comprising providing cells responsive to a zampl 
polypeptide, culturing a first portion of the cells in the 

15 presence of zampl polypeptide, culturing a second portion 
of the cells in the presence of the zampl polypeptide and 
a test compound, and detecting a decrease in a cellular 
response of the second portion of the cells as compared to 
the first portion of the cells. 

20 A further aspect of the invention provides a 

method of studying chemoattraction of monocytes in cell 
culture, comprising incubating monocytes in a culture 
medium comprising a zampl polypeptide, fragment, fusion 
protein, antibody, agonist or antagonist to study or 

25 evaluate monocyte chemoattraction. Such evaluation may be 
conducted using methods known in the art, such as those 
described by Territo et al. referenced above. 

Melanocortin receptors are G- coupled protein 
receptors which activate adenylate cyclase and cause 

30 calcium flux. The agouti protein (which contains a 36 
amino acid domain that is toxin- like) is thought to 
inhibit the binding of MSH-alpha to MCI and MC4 . In 
addition, the agouti protein is thought to be an 
antagonist of calcium channels, and certain toxins are 

3 5 believed to modulate ion flux. Experimental evidence has 
been generated, suggesting that defensins are capable of 
blocking calcium channels . 
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A further aspect of the invention provides a 
method of studying activity of the melanocortin family of 
receptors in cell culture, comprising incubating cells 
that endogenously bear such receptors ( e.g. . ACTH 
5 receptors or the like) or cells that have been engineered 
to bear such receptors in a culture medium comprising a 
ligand or putative ligand and zampl polypeptide, fragment, 
fusion protein, antibody, agonist or antagonist to study 
or evaluate ligand or putative ligand binding and/or ion 

10 flux regulation or modulation. Such evaluation may be 
conducted using methods known in the art, such as those 
described by Zhu et al. referenced above. 

A further aspect of the invention provides a 
method of studying ion flux in cell culture, comprising 

15 incubating cells that are capable of ion flux, such as 
calcium flux, sodium flux, potassium flux or the like, in 
a culture medium comprising zampl polypeptide, fragment, 
fusion protein, antibody, agonist or antagonist to study 
or evaluate ion flux regulation or modulation. 

20 A further aspect of the invention provides a 

method of studying cytocidal activity against mammalian 
cells, such as tumor cells, in cell culture, comprising 
incubating such cells in a culture medium comprising a 
zampl polypeptide, fragment, fusion protein, antibody, 

25 agonist or antagonist at high test agent and low cell 
concentration to study or evaluate cytocidal activity. 
Such evaluation may be conducted using methods known in 
the art, such as those described by Lichtenstein et al.. 
Blood 68 : 1407-10, 1986 and Sheu et al., Antimicrob. 

30 Agents Chemother. 28 : 626-9, 1993. 

Another aspect of the present invention involves 
the use of zampl polypeptides, fragments, fusion proteins 
or agonists as cell culture reagents in in vitro studies 
of exogenous microorganism infection, such as bacterial, 

35 viral or fungal infection. Such moieties may also be used 
in in vivo animal models of infection. 
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An additional aspect of the present invention is 
to study epithelial cell defensin induction in cell 
culture. In this aspect of the present invention, 
epithelial cells are cultured and exposed to pathogenic 
5 stimuli. Induction of zampl polypeptide production by the 
epithelial cells is then measured. 

Within preferred embodiments of the invention 
the isolated polynucleotides will hybridize to similar 
sized regions of SEQ ID N0:2, SEQ ID N0:3, SEQ ID NO: 10, 

10 other probe sequences specifically set forth herein, or a 
sequence complementary thereto, under stringent 
conditions. In general, stringent conditions are selected 
to be about 5°C lower than the thermal melting point (T m ) 
for the specific sequence at a defined ionic strength and 

15 pH. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly matched probe. Typical 
stringent conditions are those in which the salt 
concentration is up to about 0.03 M at pH 7 and the 

20 temperature is at least about 60°C. 

As previously noted, the isolated 

polynucleotides of the present invention include DNA and 
RNA. Methods for isolating DNA and RNA are well known in 
the art. It is generally preferred to isolate RNA from 

25 bronchial epithelium, although DNA can also be prepared 
using RNA from other tissues or isolated as genomic DNA. 
Total RNA can be prepared using guanidine HC1 extraction 
followed by isolation by centrifugation in a CsCl gradient 
(Chirgwin et al., Biochemistry 18:52-94, 1979). Poly (A) + 

30 RNA is prepared from total RNA using the method of Aviv 
and Leder ( Proc. Natl. Acad. Sci . USA 69 :1408-1412. 1972). 
Complementary DNA (cDNA) is prepared from poly (A) + RNA 
using known methods. Polynucleotides encoding zampl 
polypeptides are then identified and isolated by, for 

35 example, hybridization or PCR. 

The present invention further provides 
counterpart polypeptides and polynucleotides from other 
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species (orthologs) . These species include, but are not 
limited to mammalian, avian, amphibian, reptile, fish, 
insect and other vertebrate and invertebrate species. Of 
particular interest are zampl polypeptides from other 
5 mammalian species, including murine, rat, porcine, ovine, 
bovine, canine, feline, equine and other primate proteins. 
Species homologs of the human proteins can be cloned using 
information and compositions provided by the present 
invention in combination with conventional cloning 

10 techniques. For example, a cDNA can be cloned using mRNA 
obtained from a tissue or cell type that expresses the 
protein. Suitable sources of mRNA can be identified by 
probing Northern blots with probes designed from the 
sequences disclosed herein. A library is then prepared 

15 from mRNA of a positive tissue of cell line. A zampl 
polypeptide -encoding cDNA can then be isolated by a 
variety of methods, such as by probing with a complete or 
partial human cDNA or with one or more sets of degenerate 
probes based on the disclosed sequences. A cDNA can also 

20 be cloned using the polymerase chain reaction, or PCR 
(Mullis, U.S. Patent 4,683,202), using primers designed 
from the sequences disclosed herein. Within an additional 
method, the cDNA library can be used to transform or 
transfect host cells, and expression of the cDNA of 

25 interest can be detected with an antibody to zampl 
polypeptide. Similar techniques can also be applied to 
the isolation of genomic clones. 

Those skilled in the art will recognize that the 
sequences disclosed in SEQ ID NO:l, SEQ ID NO: 2, SEQ ID 

30 NO: 9 and SEQ ID NO: 10 represent a single allele of the 
human zampl gene and polypeptide, and that allelic 
variation and alternative splicing are expected to occur. 
Allelic variants can be cloned by probing cDNA or genomic 
libraries from different individuals according to standard 

35 procedures. Allelic variants of the DNA sequence shown in 
SEQ ID NO: 2 and SEQ ID NO: 10, including those containing 
silent mutations and those in which mutations result in 
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amino acid sequence changes, are within the scope of the 
present invention. 

The present invention also provides isolated 
zampl polypeptides that are substantially homologous to 
5 the polypeptides of SEQ ID NO: 2 and SEQ ID NO: 10 and their 
species homologs/ orthologs. The term "substantially 
homologous" is used herein to denote polypeptides having 
50%, preferably 60%, more preferably at least 80%, 
sequence identity to the sequences shown in SEQ ID NO: 2 or 

10 SEQ ID NO: 10 or their orthologs or paralogs. Such 
polypeptides will more preferably be at least 90% 
identical, and most preferably 95% or more identical to 
SEQ ID NO: 2 or SEQ ID NO: 10 or its orthologs or paralogs. 
Percent sequence identity is determined by conventional 

15 methods. See, for example, Altschul et al., Bull. Math. 
Bio. 48.: 603-616, 1986 and Henikoff and Henikoff, Proc. 
Natl. Acad. Sci. USA 89=10915-10919, 1992. Briefly, two 
amino acid sequences are aligned to optimize the alignment 
scores using a gap opening penalty of 10, a gap extension 

20 penalty of 1, and the "blosum 62" scoring matrix of 

Henikoff and Henikoff (ibid.) as shown in Table 3 (amino 

acids are indicated by the standard one -letter codes) . 

The percent identity is then calculated as: 

Total number of identical matches 
25 x 100 

[length of the longer sequence plus the 

number of gaps introduced into the longer 

sequence in order to align the two 

sequences] 
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Sequence identity of polynucleotide molecules is 
determined by similar methods using a ratio as disclosed 
above . 

Substantially homologous proteins and 

5 polypeptides are characterized as having one or more amino 
acid substitutions, deletions or additions. These changes 
are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 4) and other 
substitutions that do not significantly affect the folding 

10 or activity of the protein or polypeptide; small deletions, 
typically of one to about 30 amino acids; and small amino- 
or carboxyl- terminal extensions, such as an amino- terminal 
methionine residue, a small linker peptide of up to about 
20-25 residues, or a small extension that facilitates 

15 purification (an affinity tag) , such as a poly-histidine 
tract, protein A (Nilsson et al., EMBO J . 4.:1075, 1985; 
Nilsson et al . , Methods Enzvmol . 198=3, 1991), glutathione 
S transferase (Smith and Johnson, Gene 67:31, 1988), 
maltose binding protein (Kellerman and Ferenci, Methods 

20 Enzvmol . 90:459-463, 1982; Guan et al . , Gene 67:21-30, 
1987) , thioredoxin, ubiquitin, cellulose binding protein, 
T7 polymerase, or other antigenic epitope or binding 
domain. See, in general Ford et al., Protein Expression 
and Purification 2: 95-107, 1991. DNAs encoding affinity 

25 tags are available from commercial suppliers (e.g., 
Pharmacia Biotech, Piscataway, NJ; New England Biolabs, 
Beverly, MA) . Polypeptides comprising affinity tags can 
further comprise a proteolytic cleavage site between the 
zampl polypeptide and the affinity tag. Preferred such 

30 sites include thrombin cleavage sites and factor Xa 
cleavage sites . 
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Table 4 

Conservative amino acid substitutions 



10 



15 



Basic: 

Acidic : 
Polar: 

Hydrophobic : 
Aromatic : 
Small : 



20 



arginme 

lysine 

histidine 

glutamic acid 

aspartic acid 

glutamine 

asparagine 

leucine 

isoleucine 

valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

serine 

threonine 

methionine 



The proteins of the present invention can also 
comprise, in addition to the 20 standard amino acids, non- 
naturally occurring amino acid residues. Non-naturally 
25 occurring amino acids include, without limitation, trans-3- 
methylproline, 2 , 4-methanoproline, cis-4-hydroxyproline, 
trans-4-hydroxyproline, N-methyl- glycine, alio- threonine, 
methylthreonine, hydroxyethyl- cysteine, 

hydroxyethylhomocysteine, nitroglutamine, homoglut amine, 
30 pipecolic acid, tert-leucine, norvaline, 2- 

azaphenylalanine, 3-azaphenylalanine, 4 -azaphenyl -alanine, 
4-f luorophenylalanine, 4-hydroxyproline, 6-N-methyl lysine, 
2-aminoisobutyric acid, isovaline and a-methyl serine. 
Several methods are known in the art for incorporating non- 
35 naturally occurring amino acid residues into proteins. For 
example, an in vitro system can be employed wherein 
nonsense mutations are suppressed using chemically 
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aminoacylated suppressor tRNAs. Methods for synthesizing 
amino acids and aminoacylating tRNA are known in the art . 
Transcription and translation of plasmids containing 
nonsense mutations are carried out in a cell free system 
5 comprising an E. coli S30 extract and commercially 
available enzymes and other reagents. Proteins are 
purified by chromatography. See, for example, Robertson et 
al. f J. Am. Chem. Soc . 113.: 2722, 1991; Ellman et al . , Meth. 
Enzvmol . 202:301, 1991; Chung et al . , Science 259 : 806-09 . 

lo 1993; and Chung et al., Proc. Natl. Acad. Sci . USA 
.90:10145-49, 1993) . In a second method, translation is 
carried out in Xenopus oocytes by microinjection of mutated 
mRNA and chemically aminoacylated suppressor tRNAs 
(Turcatti et al . , J. Biol. Chem. 271 :19991-98. 1996). 

15 Within a third method, E. coli cells are cultured in the 
absence of a natural amino acid that is to be replaced 
(e.g., phenylalanine) and in the presence of the desired 
non-naturally occurring amino acid(s) (e.g., 2- 
azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, 

20 or 4-f luorophenylalanine) . The non-naturally occurring 
amino acid is incorporated into the protein in place of its 
natural counterpart. See, Koide et al . , Biochem. 33:7470- 
76, 1994. Naturally occurring amino acid residues can be 
converted to non-naturally occurring species by in vitro 

25 chemical modification. Chemical modification can be 
combined with site-directed mutagenesis to further expand 
the range of substitutions (Wynn and Richards, Protein Sci. 
2:395-403, 1993) . 

A limited number of non- conservative amino acids, 

30 amino acids that are not encoded by the genetic code, non- 
naturally occurring amino acids, and unnatural amino acids 
may be substituted for zampl polypeptide amino acid 
residues. "Unnatural amino acids" have been modified after 
protein synthesis, and/or have a chemical structure in 

35 their side chain (s) different from that of the standard 
amino acids. Unnatural amino acids can be chemically 
synthesized, or preferably, are commercially available, and 
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include pipecolic acid, thiazolidine carboxylic acid, 
dehydroproline, 3- and 4-methylproline, and 3,3- 
dimethylproline . 

Essential amino acids in the zampl polypeptides 
5 of the present invention can be identified according to 
procedures known in the art, such as site-directed 
mutagenesis or alanine -scanning mutagenesis (Cunningham and 
Wells, Science 244 : 1081-1085, 1989) . In the latter 
technique, single alanine mutations are introduced at every 

10 residue in the molecule, and the resultant mutant molecules 
are tested for biological activity (e.g.,- anti-microbial 
activity) to identify amino acid residues that are critical 
to the activity of the molecule. See also, Hilton et al., 
J. Biol. Chem. 271:4699-4708, 1996. Sites of ligand- 

15 receptor or other biological interaction can also be 
determined by physical analysis of structure, as determined 
by such techniques as nuclear magnetic resonance, 
crystallography, electron diffraction or photoaf f inity 
labeling, in conjunction with mutation of putative contact 

20 site amino acids. See, for example, de Vos et al., Science 
255 :306-312. 1992; Smith et al., J. Mol . Biol. 224:899-904, 
1992; Wlodaver et al., FEBS Lett . 309:59-64, 1992. The 
identities of essential amino acids can also be inferred 
from analysis of homologies with related P-defensins. 

25 Multiple amino acid substitutions can be made and 

tested using known methods of mutagenesis and screening, 
such as those disclosed by Reidhaar-Olson and Sauer 
( Science 241:53-57, 1988) or Bowie and Sauer ( Proc. Natl. 
Acad. Sci. USA 86 :2152-2156. 1989). Briefly, these authors 

30 disclose methods for simultaneously randomizing two or more 
positions in a polypeptide, selecting for functional 
polypeptide, and then sequencing the mutagenized 
polypeptides to determine the spectrum of allowable 
substitutions at each position. Other methods that can be 

35 used include phage display (e.g., Lowman et al., Biochem. 
30:10832-10837, 1991; Ladner et al., U.S. Patent No. 
5,223,409; Huse, WIPO Publication WO 92/06204) and region- 
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directed mutagenesis (Derbyshire et al . , Gene £6:145, 1986; 
Ner et al., DNA 7:127, 1988). 

Variants of the disclosed zampl DNA and 
polypeptide sequences can be generated through DNA 
5 shuffling as disclosed by Stemmer, Nature 370:389-91, 1994, 
Stemmer, Proc. Natl. Acad. Sci . USA 91:10747-51, 1994 and 
WIPO Publication WO 97/20078. Briefly, variant DNAs are 
generated by in vitro homologous recombination by random 
fragmentation of a parent DNA followed by reassembly using 

10 PCR, resulting in randomly introduced point mutations. 
This technique can be modified by using a family of parent 
DNAs, such as allelic variants or DNAs from different 
species, to introduce additional variability into the 
process. Selection or screening for the desired activity, 

15 followed by additional iterations of mutagenesis and assay 
provides for rapid "evolution" of sequences by selecting 
for desirable mutations while simultaneously selecting 
against detrimental changes. 

Mutagenesis methods as disclosed above can be 

20 combined with high- throughput , automated screening methods 
to detect activity of cloned, mutagenized polypeptides in 
host cells. Mutagenized DNA molecules that encode active 
polypeptides (e.g., ant i -microbial activity) can be 
recovered from the host cells and rapidly sequenced using 

25 modern equipment. These methods allow the rapid 

determination of the importance of individual amino acid 
residues in a polypeptide of interest, and can be applied 
to polypeptides of unknown structure. 

Using the methods discussed above, one of 

30 ordinary skill in the art can identify and/or prepare a 
variety of polypeptides that are substantially homologous 
to residues 1 to 65 of SEQ ID NO: 2 or to residues 1 to 67 
of SEQ ID NO: 10 or allelic variants thereof and retain the 
ant i- microbial properties of the wild-type protein. Such 

35 polypeptides may include additional amino acids from 
affinity tags and the like. Such polypeptides may also 
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include additional polypeptide segments as generally 
disclosed above. 

The polypeptides of the present invention, 
including full-length proteins, fragments thereof and 
5 fusion proteins, can be produced in genetically engineered 
host cells according to conventional techniques. However, 
host cells must be selected with some care as a result of 
the ant i -microbial activity of the molecules of the present 
invention. For example, any cell culture-based system must 

10 be evaluated, because zampl polypeptides, fragments, fusion 
proteins, antibodies, agonists or antagonists may kill the 
host cell as a part of an anti-microbial function. Zampl 
polypeptides are of a small enough size to permit 
preparation by PCR or other protein chemistry techniques to 

15 avoid any potential host cell toxicity problems. 
Alternatively, native or engineered precursor proteins, 
prior to post-translational cleavage to yield the mature 
zampl polypeptide, are inactive, thereby limiting host cell 
cytotoxicity prior to lysosomal packaging. See, for 

20 example, Lehrer et al . , Cell 64: 229-30, 1991. Thus, 
precursor proteins to zampl polypeptides may be produced in 
microbial cell culture. 

Suitable host cells are those cell types that can 
be transformed or transfected with exogenous DNA and grown 

25 in culture, and include bacteria, fungal cells, and 
cultured higher eukaryotic cells. Eukaryotic cells, 
particularly cultured cells of multicellular organisms, are 
preferred. Techniques for manipulating cloned DNA 

molecules and introducing exogenous DNA into a variety of 

30 host cells are disclosed by Sambrook et al . , Molecular 
Cloning: A Laboratory Manual . 2nd ed. , Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989, and Ausubel 
et al. (eds.). Current Protocols in Molecular Biology . John 
Wiley and Sons, Inc., NY, 1987. 

35 In general, a DNA sequence encoding a zampl 

polypeptide of the present invention is operably linked to 
other genetic elements required for its expression, 
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generally including a transcription promoter and terminator 
within an expression vector. The vector will also commonly 
contain one or more selectable markers and one or more 
origins of replication, although those skilled in the art 
5 will recognize that within certain systems selectable 
markers may be provided on separate vectors, and 
replication of the exogenous DNA may be provided by 
integration into the host cell genome. Selection of 
promoters, terminators, selectable markers, vectors and 

10 other elements is a matter of routine design within the 
level of ordinary skill in the art. Many such elements are 
described in the literature and are available through 
commercial suppliers. 

To direct a zampl polypeptide into the secretory 

15 pathway of a host cell, a secretory signal sequence (also 
known as a leader sequence, prepro sequence or pre 
sequence) is provided in the expression vector. The 
secretory signal sequence may be that of the zampl 
polypeptide, or may be derived from another secreted 

20 protein (e.g., t-PA) or synthesized de novo. The secretory 
signal sequence is joined to the zampl polypeptide -encoding 
DNA sequence in the correct reading frame. Secretory 
signal sequences are commonly positioned 5' to the DNA 
sequence encoding the polypeptide of interest, although 

25 certain signal sequences may be positioned elsewhere in the 
DNA sequence of interest (see, e.g., Welch et al . , U.S. 
Patent No. 5,037,743; Holland et al . , U.S. Patent No.. 
5, 143,830) . 

Alternatively, the secretory signal sequence 
30 contained in the polypeptides of the present invention is 
used to direct other polypeptides into the secretory 
pathway. The present invention provides for such fusion 
polypeptides. A signal fusion polypeptide can be made 
wherein a secretory signal sequence derived from amino acid 
35 residues 1-18 or 12 of SEQ ID NO: 2 or amino acid residues 
1-20 or 22 of SEQ ID NO: 10, is operably linked to another 
polypeptide using methods known in the art and disclosed 
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herein. The secretory signal sequence contained in the 
fusion polypeptides of the present invention is preferably 
fused amino -terminally to an additional peptide to direct 
the additional peptide into the secretory pathway. Such 
5 constructs have numerous applications known in the art. 
For example, these novel secretory signal sequence fusion 
constructs can direct the secretion of an active component 
of a normally non- secreted protein, such as a receptor. 
Such fusions may be used in vivo or in vitro to direct 

10 peptides through the secretory pathway. 

Cultured mammalian cells are also preferred hosts 
within the present invention. Methods for introducing 
exogenous DNA into mammalian host cells include calcium 
phosphate -mediated transfection (Wigler et al., Cell 

15 14:725, 1978; Corsaro and Pearson, Somatic Cell Genetics 
7:603, 1981: Graham and Van der Eb, Virology 52:456, 1973), 
electroporation (Neumann et al . , EMBO J. 1:841-845, 1982), 
DEAE-dextran mediated transfection (Ausubel et al . , eds., 
Current Protocols in Molecular Biology . John Wiley and 

20 Sons, Inc., NY, 1987), liposome -mediated transfection 
(Hawley-Nelson et al . , Focus 1.5:73, 1993; Ciccarone et al., 
Focus 15:80, 1993) , and viral vectors (A. Miller and G. 
Rosman, BioTechnicrues 7:980-90, 1989; Q. Wang and M. Finer, 
Nature Med . 2:714-16, 1996). The production of recombinant 

25 polypeptides in cultured mammalian cells is disclosed, for 
example, by Levinson et al., U.S. Patent No. 4,713,339; 
Hagen et al., U.S. Patent No. 4,784,950; Palmiter et al., 
U.S. Patent No. 4,579,821; and Ringold, U.S. Patent No. 
4,656,134. Preferred cultured mammalian cells include the 

30 COS-1 (ATCC No. CRL 1650), COS-7 {ATCC No. CRL 1651), BHK 
570 (ATCC No. CRL 10314), 293 (ATCC No. CRL 1573; Graham et 
al., J. Gen. Virol. 36:59-72, 1977) and Chinese hamster 
ovary (e.g. CHO-K1; ATCC No. CCL 61) cell lines. 
Additional suitable cell lines are known in the art and 

35 available from public depositories such as the American 
Type Culture Collection, Rockville, Maryland. In general, 
strong transcription promoters are preferred, such as 
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promoters from SV-40 or cytomegalovirus. See, e.g., U.S. 
Patent No. 4,956,288. Other suitable promoters include 
those from metal lothionein genes (U.S. Patent Nos . 
4,579,821 and 4,601,978) and the adenovirus major late 
5 promoter. 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 

"transfectants". Cells that have been cultured in the 

10 presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
-stable transfectants." A preferred selectable marker is a 
gene encoding resistance to the antibiotic neomycin. 
Selection is carried out in the presence of a neomycin-type 

15 drug, such as G-418 or the like. Selection systems may 
also be used to increase the expression level of the gene 
of interest, a process referred to as "amplification." 
Amplification is carried out by culturing transfectants in 
the presence of a low level of the selective agent and then 

20 increasing the amount of selective agent to select for 
cells that produce high levels of the products of the 
introduced genes. A preferred amplifiable selectable 
marker is dihydrof olate reductase, which confers resistance 
to methotrexate. Other drug resistance genes (e.g., 

25 hygromycin resistance, multi-drug resistance, puromycin 
acetyltransf erase) can also be used. Alternative markers 
that introduce an altered phenotype, such as green 
fluorescent protein, or cell surface proteins such as CD4, 
CD8, Class I MHC, placental alkaline phosphatase may be 

30 used to sort transfected cells from untransf ected cells by 
such means as FACS sorting or magnetic bead separation 
technology. 

Other higher eukaryotic cells can also be used as 
hosts, including plant cells, insect cells and avian cells. 
35 The use of Agrobacterium rhizogenes as a vector for 
expressing genes in plant cells has been reviewed by Sinkar 
et al., J. Biosci . manaalore ) 11:47-58, 1987. 



WO 99/13080 



PCI7US98/19222 



40 

Transformation of insect cells and production of foreign 
polypeptides therein is disclosed by Guarino et al . , U.S. 
Patent No. 5,162,222 and WIPO publication WO 94/06463. 
Insect cells can be infected with recombinant baculovirus, 
5 commonly derived from Autogrrapha calif ornica nuclear 
polyhedrosis virus (AcNPV) . DNA encoding the zampl 
polypeptide is inserted into the baculoviral genome in 
place of the AcNPV polyhedrin gene coding sequence by one 
of two methods. The first is the traditional method of 

10 homologous DNA recombination between wild-type AcNPV and a 
transfer vector containing the zampl flanked by AcNPV 
sequences. Suitable insect cells, e.g. SF9 cells, are 
infected with wild-type AcNPV and transfected with a 
transfer vector comprising a zampl polynucleotide operably 

15 linked to an AcNPV polyhedrin gene promoter, terminator, 
and flanking sequences. See, King and Possee, The 
Baculovirus Expression System: A Laboratory Guide , London , 
Chapman & Hall; O'Reilly et al., Baculovirus Expression 
Vectors: A Laboratory Manual , New York, Oxford University 

20 Press., 1994; and, Richardson, C. D., Ed., Baculovirus 
Expression Protocols. Methods in Molecular Biology . Totowa, 
NJ, Humana Press, 1995. Natural recombination within an 
insect cell will result in a recombinant baculovirus which 
contains zampl driven by the polyhedrin promoter. 

25 Recombinant viral stocks are made by methods commonly used 
in the art . 

The second method of making recombinant 
baculovirus utilizes a transposon- based system described by 
Luckow (Luckow et al., J Virol . 67:4566-79, 1993). This 

30 system is sold in the Bac-to-Bac kit (Life Technologies, 
Rockville, MD) . This system utilizes a transfer vector, 
pFastBacl™ (Life Technologies) containing a Tn7 transposon 
to move the DNA encoding the zampl polypeptide into a 
baculovirus genome maintained in E. coli as a large plasmid 

35 called a "bacmid." The pFastBacl™ transfer vector utilizes 
the AcNPV polyhedrin promoter to drive the expression of 
the gene of interest, in this case zampl. However, 
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pFastBacl™ can be modified to a considerable degree. The 
polyhedrin promoter can be removed and substituted with the 
baculovirus basic protein promoter (also known as Pcor, 
p6.9 or MP promoter) which is expressed earlier in the 
5 baculovirus infection, and has been shown to be 
advantageous for expressing secreted proteins. See, Hill- 
Perkin and Possee, J. Gen. Virol . 71:971-6, 1990; Bonning. 
et al., J. Gen. Virol . 75:1551-6, 1994; and, Chazenbalk and 
Rapoport, J. Biol. Chem . 270 :1543-9, 1995. In such 

10 transfer vector constructs, a short or long version of the 
basic protein promoter can be used. Moreover, transfer 
vectors can be constructed which replace the native zampl 
secretory signal sequences with secretory signal sequences 
derived from insect proteins. For example, a secretory 

15 signal sequence from Ecdysteroid Glucosyltransf erase (EGT) , 
honey bee Melittin (Invitrogen, Carlsbad, CA) , or 
baculovirus gp67 (PharMingen, San Diego, CA) can be used in 
constructs to replace the native zampl secretory signal 
sequence. In addition, transfer vectors can include an in- 

20 frame fusion with DNA encoding an epitope tag at the C- or 
N- terminus of the expressed zampl polypeptide, for example, 
a Glu-Glu epitope tag (Grussenmeyer et al . , Proc . Nat 1 . 
Acad. Sci . 82:7952-4, 1985). Using a technique known in 
the art, a transfer vector containing zampl is transformed 

25 into E. coli, and screened for bacmids which contain an 
interrupted lacZ gene indicative of recombinant 
baculovirus. The bacmid DNA containing the recombinant 
baculovirus genome is isolated, using common techniques, 
and used to transfect Spodoptera frugiperda cells, e.g. Sf9 

30 cells. Recombinant virus that expresses zampl is 

subsequently produced. Recombinant viral stocks are made 
by methods commonly used the art . 

The recombinant virus is used to infect host 
cells, typically a cell line derived from the fall 

35 armyworm, Spodoptera frugiperda. See, in general, Glick 
and Pasternak, Molecular Biotechnology : Principles and 
Applications of Recombinant DNA . ASM Press, Washington, 
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D.C., 1994. Another suitable cell line is the High FiveO™ 
cell line (Invitrogen) derived from Trichoplusia ni (U.S. 
Patent #5,300,435). Commercially available serum-free 

media are used to grow and maintain the cells. Suitable 
5 media are Sf900 II™ (Life Technologies) or ESF 921™ 
(Expression Systems) for the Sf9 cells; and Ex-cellO405™ 
(JRH Biosciences, Lenexa, KS) or Express FiveO™ (Life 
Technologies) for the T. ni cells. The cells are grown up 
from an inoculation density of approximately 2-5 x 10 3 
10 cells to a density of 1-2 x 10 s cells at which time a 
recombinant viral stock is added at a multiplicity of 
infection (MOI) of 0.1 to 10, more typically near 3. The 
recombinant virus -infected cells typically produce the 
recombinant zampl polypeptide at 12-72 hours post -infection 
15 and secrete it with varying efficiency into the medium. 
The culture is usually harvested 48 hours post-infection. 
Centrifugation is used to separate the cells from the 
medium (supernatant) . The supernatant containing the zampl 
polypeptide is filtered through micropore filters, usually 
20 0.45 fim pore size. Procedures used are generally described 
in available laboratory manuals (King and Possee, ibid . ; 
O'Reilly et al . , ibid . ; Richardson, ibid.). Subsequent 

purification of the zampl polypeptide from the supernatant 
can be achieved using methods described herein. 
25 Fungal cells, including yeast cells, can also be 

used within the present invention. Yeast species of 

particular interest in this regard include Saccharomyces 
cerevisiae, Pichia pastoris, and Pichia methanolica. 

Methods for transforming S. cerevisiae cells with exogenous 
30 DNA and producing recombinant polypeptides therefrom are 

disclosed by, for example, Kawasaki, U.S. Patent No. 

4,599,311; Kawasaki et al., U.S. Patent No. 4,931,373; 

Brake, U.S. Patent No. 4,870,008; Welch et al., U.S. Patent 

No. 5,037,743; and Murray et al . , U.S. Patent No. 
35 4,845,075. Transformed cells are selected by phenotype 

determined by the selectable marker, commonly drug 

resistance or the ability to grow in the absence of a 
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particular nutrient (e.g., leucine). A preferred vector 
system for use in Saccharomyces cerevisiae is the POT1 
vector system disclosed by Kawasaki et al . (U.S. Patent No. 
4,931,373), which allows transformed cells to be selected 
5 by growth in glucose -containing media. Suitable promoters 
and terminators for use in yeast include those from 
glycolytic enzyme genes (see, e.g., Kawasaki, U.S. Patent 
No. 4,599,311; Kingsman et al . , U.S. Patent No. 4,615,974; 
and Bitter, U.S. Patent No. 4,977,092) and alcohol 

10 dehydrogenase genes. See also U.S. Patents Nos. 4,990,446; 
5,063,154; 5,139,93 6 and 4,661,454. Transformation systems 
for other yeasts, including Hansenula polymorphs, 
Schizosaccharomyces pombe, Kluyveromyces lactis, 

Kluyveromyces fragilis, Ustilago maydis, Pichia pastoris, 

15 Pichia methanolica, Pichia guillermondii and Candida 
maltosa are known in the art. See, for example, Gleeson et 
al., J. Gen. Microbiol . 132:3459-65, 1986 and Cregg, U.S. 
Patent No. 4,882,279. Aspergillus cells may be utilized 
according to the methods of McKnight et al., U.S. Patent 

20 No. 4,935,349. Methods for transforming Acremonium 

chrysogenum are disclosed by Sumino et al., U.S. Patent No. 
5,162,228. Methods for transforming Weurospor-a are 

disclosed by Lambowitz, U.S. Patent No. 4,486,533. 

The use of Pichia methanolica as host for the 

25 production of recombinant proteins is disclosed in WIPO 
Publications WO 97/17450, WO 97/17451, WO 98/02536, and WO 
98/02565. DNA molecules for use in transforming P. 
methanolica will commonly be prepared as double- stranded, 
circular plasmids, which are preferably linearized prior to 

30 transformation. For polypeptide production in P. 

methanolica, it is preferred that the promoter and 
terminator in the plasmid be that of a P. methanolica gene, 
such as a P. methanolica alcohol utilization gene {AUG1 or 
AUG2) . Other useful promoters include those of the 

35 dihydroxyacetone synthase (DHAS) , formate dehydrogenase 
(FMD) , and catalase (CAT) genes. To facilitate integration 
of the DNA into the host chromosome, it is preferred to 
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have the entire expression segment of the plasmid flanked 
at both ends by host DNA sequences. A preferred selectable 
marker for use in Pichia methanolica is a P. methanolica 
ADE2 gene, which encodes phosphor ibosyl-5-aminoimidazole 
5 carboxylase (AIRC; EC 4.1.1.21), which allows ade2 host 
cells to grow in the absence of adenine. For large-scale, 
industrial processes where it is desirable to minimize the 
use of methanol, it is preferred to use host cells in which 
both methanol utilization genes (AUG1 and AUG2) are 

10 deleted. For production of secreted proteins, host cells 
deficient in vacuolar protease genes {PEP4 and PRB1) are 
preferred. Electroporation is used to facilitate the 
introduction of a plasmid containing DNA encoding a 
polypeptide of interest into P. methanolica cells. It is 

15 preferred to transform P. methanolica cells by 

electroporation using an exponentially decaying, pulsed 
electric field having a field strength of from 2.5 to 4.5 
kV/cm, preferably about 3.75 kv/cm, and a time constant (t) 
of from 1 to 40 milliseconds, most preferably about 20 

20 milliseconds. 

Prokaryotic host cells, including strains of the 
bacteria Escherichia coli, Bacillus and other genera are 
also useful host cells within the present invention. 
Techniques for transforming these hosts and expressing 

25 foreign DNA sequences cloned therein are well known in the 
art (see, e.g., Sambrook et al . , ibid.). When expressing a 
zampl polypeptide in bacteria such as E. coli, the 
polypeptide may be retained in the cytoplasm, typically as 
insoluble granules, or may be directed to the periplasmic 

30 space by a bacterial secretion sequence. In the former 
case, the cells are lysed, and the granules are recovered 
and denatured using, for example, guanidine isothiocyanate 
or urea. The denatured polypeptide can then be refolded 
and dimerized by diluting the denaturant, such as by 

35 dialysis against a solution of urea and a combination of 
reduced and oxidized glutathione, followed by dialysis 
against a buffered saline solution. In the latter case, 
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the polypeptide can be recovered from the periplasmic space 
in a soluble and functional form by disrupting the cells 
(by, for example, sonication or osmotic shock) to release 
the contents of the periplasmic space and recovering the 
5 protein, thereby obviating the need for denaturation and 
refolding. 

Transformed or transfected host cells are 
cultured according to conventional procedures in a culture 
medium containing nutrients and other components required 

10 for the growth of the chosen host cells. A variety of 
suitable media, including defined media and complex media, 
are known in the art and generally include a carbon source, 
a nitrogen source, essential amino acids, vitamins and 
minerals . Media may also contain such components as growth 

15 factors or serum, as required. The growth medium will 
generally select for cells containing the exogenously added 
DNA by, for example, drug selection or deficiency in an 
essential nutrient which is complemented by the selectable 
marker carried on the expression vector or co-transf ected 

20 into the host cell. P. methanolica cells are cultured in a 
medium comprising adequate sources of carbon, nitrogen and 
trace nutrients at a temperature of about 25°C to 35°C. 
Liquid cultures are provided with sufficient aeration by 
conventional means, such as shaking of small flasks or 

25 sparging of fermentors. A preferred culture medium for P. 
methanolica is YEPD (2% D-glucose, 2% Bacto™ Peptone (Difco 
Laboratories, Detroit, MI), 1% Bacto™ yeast extract (Difco 
Laboratories), 0.004% adenine and 0.006% L- leucine ) . 

Expressed recombinant zampl polypeptides (or 

30 chimeric zampl polypeptides) can be purified using 
fractionation and/or conventional purification methods and 
media. Ammonium sulfate precipitation and acid or 

chaotrope extraction may be used for fractionation of 
samples. Exemplary purification steps may include 

35 hydroxyapatite, size exclusion, FPLC and reverse-phase high 
performance liquid chromatography. Suitable anion exchange 
media include derivatized dextrans, agarose, cellulose, 
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polyacrylamide, specialty silicas, and the like. PEI, 
DEAE, QAE and Q derivatives are preferred, with DEAE Fast- 
Flow Sepharose (Pharmacia, Piscataway, NJ) being 
particularly preferred. Exemplary chromatographic media 
5 include those media derivatized with phenyl, butyl, or 
octyl groups, such as Phenyl -Sepharose FF (Pharmacia) , 
Toyopearl butyl 650 (Toso Haas, Mont gomeryvi lie, PA), 
Octyl -Sepharose (Pharmacia) and the like; or polyacrylic 
resins, such as Amberchrom CG 71 (Toso Haas) and the like. 

10 Suitable solid supports include glass beads, silica-based 
resins, cellulosic resins, agarose beads, cross -linked 
agarose beads, polystyrene beads, cross -linked 
polyacrylamide resins and the like that are insoluble under 
the conditions in which they are to be used. These 

15 supports may be modified with reactive groups that allow 
attachment of proteins by amino groups, carboxyl groups, 
sulfhydryl groups, hydroxyl groups and/or carbohydrate 
moieties. Examples of coupling chemistries include 

cyanogen bromide activation, N-hydroxysuccinimide 

20 activation, epoxide activation, sulfhydryl activation, 
hydrazide activation, and carboxyl and amino derivatives 
for carbodiimide coupling chemistries . These and other 
solid media are well known and widely used in the art, and 
are available from commercial suppliers. Methods for 

25 binding receptor polypeptides to support media are well 
known in the art . Selection of a particular method is a 
matter of routine design and is determined in part by the 
properties of the chosen support. See, for example, 
Affinity Chromatography: Principles & Methods . Pharmacia 

30 LKB Biotechnology, Uppsala, Sweden, 1988. 

The polypeptides of the present invention can be 
isolated by exploitation of their structural properties. 
For example, immobilized metal ion adsorption (IMAC) 
chromatography can be used to purify histidine-rich 

35 proteins or proteins having a His-affinity tag. Briefly, a 
gel is first charged with divalent metal ions to form a 
chelate (Sulkowski, Trends in Biochem. 3:1-7, 1985) . 
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Histidine-rich proteins will be adsorbed to this matrix 
with differing affinities, depending upon the metal ion 
used, and will be eluted by competitive elution, lowering 
the pH, or use of strong chelating agents. Other methods 
5 of purification include purification of glycosylated 
proteins by lectin affinity chromatography and ion exchange 
chromatography ( Methods in Enzvmol . . Vol. 182, "Guide to 
Protein Purification", M. Deutscher, (ed.), Acad. Press, 
San Diego, 1990, pp. 529-39) . Within additional embodiments 

10 of the invention, a fusion of the polypeptide of interest 
and an affinity tag (e.g., maltose -binding protein, an 
immunoglobulin domain) may be constructed to facilitate 
purification. 

Protein refolding (and optionally reoxidation) 

15 procedures may be advantageously used. It is preferred to 
purify the protein to >80% purity, more preferably to >90% 
purity, even more preferably >95%, and particularly 
preferred is a pharmaceutically pure state, that is greater 
than 99.9% pure with respect to contaminating 

20 macromolecules, particularly other proteins and nucleic 
acids, and free of infectious and pyrogenic agents. 
Preferably, a purified protein is substantially free of 
other proteins, particularly other proteins of animal 
origin. 

25 Zampl polypeptides or fragments thereof may also 

be prepared through chemical synthesis. Zampl polypeptides 
may be monomers or multimers; glycosylated or non- 
glycosylated; pegylated or non-pegylated; amidated or non- 
amidated; sulfated or non- sulfated; and may or may not 

30 include an initial methionine amino acid residue. For 
example, zampl polypeptides can be synthesized by exclusive 
solid phase synthesis, partial solid phase methods, 
fragment condensation or classical solution synthesis. The 
polypeptides are preferably prepared by solid phase peptide 

35 synthesis, for example as described by Merrifield, J. Am. 
Chem . Soc . 85:2149, 1963. The synthesis is carried out 
with amino acids that are protected at the alpha- amino 
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terminus. Trif unctional amino acids with labile side- 
chains are also protected with suitable groups to prevent 
undesired chemical reactions from occurring during the 
assembly of the polypeptides. The alpha-amino protecting 
5 group is selectively removed to allow subsequent reaction 
to take place at the amino - terminus . The conditions for 
the removal of the alpha-amino protecting group do not 
remove the side-chain protecting groups. 

The alpha-amino protecting groups are those known 

10 to be useful in the art of stepwise polypeptide synthesis. 
Included are acyl type protecting groups (e.g., formyl, 
trif luoroacetyl, acetyl) , aryl type protecting groups 
(e.g., biotinyl) , aromatic urethane type protecting groups 
[e.g., benzyloxycarbonyl (Cbz) , substituted 

15 benzyloxycarbonyl and 9-f luorenylmethyloxy-carbonyl 

(Fmoc) ] , aliphatic urethane protecting groups [e.g., t- 
butyloxycarbonyl (tBoc) , isopropyloxycarbonyl, 

cyclohexloxycarbonyl] and alkyl type protecting groups 
(e.g., benzyl, triphenylmethyl) . The preferred protecting 

20 groups are tBoc and Fmoc. 

The side-chain protecting groups selected must 
remain intact during coupling and not be removed during the 
deprotection of the amino- terminus protecting group or 
during coupling conditions. The side-chain protecting 

25 groups must also be removable upon the completion of 
synthesis using reaction conditions that will not alter the 
finished polypeptide. In tBoc chemistry, the side-chain 
protecting groups for trif unctional amino acids are mostly 
benzyl based. In Fmoc chemistry, they are mostly tert- 

30 butyl or trityl based. 

In tBoc chemistry, the preferred side-chain 
protecting groups are tosyl for arginine, cyclohexyl for 
aspartic acid, 4-methylbenzyl (and acetamidomethyl ) for 
cysteine, benzyl for glutamic acid, serine and threonine, 

35 benzyloxymethyl (and dinitrophenyl) for histidine, 2-Cl- 
benzyloxycarbonyl for lysine, formyl for tryptophan and 2- 
bromobenzyl for tyrosine. In Fmoc chemistry, the preferred 
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side -chain protecting groups are 2,2,5,7,8- 

pentamethylchroman-6-sulf onyl (Pmc) or 2, 2, 4, 6, 7 -pent a- 
methyldihydrobenzof uran-5-sulfonyl (Pbf) for arginine, 
trityl for asparagine, cysteine, glutamine and histidine, 
5 tert-butyl for aspartic acid, glutamic acid, serine, 
threonine and tyrosine, tBoc for lysine and tryptophan. 

For the synthesis of phosphopeptides, either 
direct or post-assembly incorporation of the phosphate 
group is used. In the direct incorporation strategy, the 
10 phosphate group on serine, threonine or tyrosine may be 
protected by methyl, benzyl, or tert-butyl in Fmoc 
chemistry or by methyl, benzyl or phenyl in tBoc chemistry. 
Direct incorporation of phosphotyrosine without phosphate 
protection can also be used in Fmoc chemistry. In the post- 
15 assembly incorporation strategy, the unprotected hydroxyl 
groups of serine, threonine or tyrosine are derivatized on 
solid phase with di -tert -butyl - , dibenzyl- or dimethyl - 
N, N ' -diisopropyl-phosphoramidite and then oxidized by tert- 
butylhydro-peroxide . 
20 Solid phase synthesis is usually carried out from 

the carboxyl- terminus by coupling the alpha-amino protected 
(side -chain protected) amino acid to a suitable solid 
support. An ester linkage is formed when the attachment is 
made to a chloromethyl, chlorotrityl or hydroxymethyl 
25 resin, and the resulting polypeptide will have a free 
carboxyl group at the C-terminus. Alternatively, when an 
amide resin such as benzhydrylamine or p- 
methylbenzhydrylamine resin (for tBoc chemistry) and Rink 
amide or PAL resin (for Fmoc chemistry) are used, an amide 
30 bond is formed and the resulting polypeptide will have a 
carboxamide group at the C-terminus. These resins, whether 
polystyrene- or polyamide -based or polyethyleneglycol- 
grafted, with or without a handle or linker, with or 
without the first amino acid attached, are commercially 
35 available, and their preparations have been described by 
Stewart et al . , "Solid Phase Peptide Synthesis" (2nd 
Edition), (Pierce Chemical Co., Rockford, IL, 1984) and 
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Bayer and Rapp, Chem . Pept . Prot . 3.: 3, 1986; and Atherton 
et al . , Solid Phase Peptide Synthesis : A Practical 
Approach , IRL Press, Oxford, 1989. 

The C-terminal amino acid, protected at the side 
5 chain if necessary, and at the alpha-amino group, is 
attached to a hydroxylmethyl resin using various activating 
agents including dicyclohexylcarbodiimide (DCC) , N,N'- 
diisopropylcarbodiimide (DIPCDI) and carbonyldi imidazole 
(CDI) . It can be attached to chloromethyl or chlorotrityl 

10 resin directly in its cesium tetramethylammonium salt form 
or in the presence of triethylamine (TEA) or 
diisopropylethylamine (DIEA) . First amino acid attachment 
to an amide resin is the same as amide bond formation 
during coupling reactions. 

15 Following the attachment to the resin support, 

the alpha-amino protecting group is removed using various 
reagents depending on the protecting chemistry (e.g., tBoc, 
Fmoc) . The extent of Fmoc removal can be monitored at 300- 
320 nm or by a conductivity cell. After removal of the 

20 alpha-amino protecting group, the remaining protected amino 
acids are coupled stepwise in the required order to obtain 
the desired sequence. 

Various activating agents can be used for the 
coupling reactions including DCC, DIPCDI , 2-chloro-l, 3- 

25 dimethyl imidium hexaf luorophosphate (CIP) , benzotriazol-1- 
yl-oxy-tris- (dimethylamino) -phosphonium hexafluoro- 
phosphate (BOP) and its pyrrolidine analog (PyBOP) , bromo- 
tris-pyrrolidino-phosphonium hexaf luorophosphate (PyBrOP) , 
O- (benzot r iazol - 1 -yl ) -1,1,3,3- 1 e tramethyl -uronium 

30 hexaf luorophosphate (HBTU) and its tetraf luoroborate analog 
(TBTU) or its pyrrolidine analog (HBPyU) , 0- (7- 
azabenzotriazol-l-yl) -1,1,3, 3 -tetramethyl -uronium 
hexaf luorophosphate (HATU) and its tetraf luoroborate analog 
(TATU) or its pyrrolidine analog (HAPyU) . The most common 

35 catalytic additives used in coupling reactions include 4- 
dimethylaminopyridine (DMAP) , 3-hydroxy-3,4-dihydro-4-oxo- 
1 , 2 , 3-benzotriazine (HODhbt) , N-hydroxybenzotriazole (HOBt) 
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and l-hydroxy-7-azabenzotriazole (HOAt) . Each protected 
amino acid is used in excess (>2.0 equivalents), and the 
couplings are usually carried out in N-methylpyrrolidone 
(NMP) or in DMF, CH2C12 or mixtures thereof. The extent of 
5 completion of the coupling reaction can be monitored at 
each stage, e.g., by the ninhydrin reaction as described by 
Kaiser et al., Anal. Biochem . 34:595, 1970. 

After the entire assembly of the desired peptide, 
the peptide -resin is cleaved with a reagent with proper 

10 scavengers. The Fmoc peptides are usually cleaved and 
deprotected by TFA with scavengers (e.g., H20, 
ethanedithiol, phenol and thioanisole) . The tBoc peptides 
are usually cleaved and deprotected with liquid HF for 1-2 
hours at -5 to 0° C, which cleaves the polypeptide from the 

15 resin and removes most of the side-chain protecting groups. 
Scavengers such as anisole, dimethylsulf ide and p- 
thiocresol are usually used with the liquid HF to prevent 
cations formed during the cleavage from alkylating and 
acylating the amino acid residues present in the 

20 polypeptide. The formyl group of tryptophan and the 
dinitrophenyl group of histidine need to be removed, 
respectively by piperidine and thiophenyl in DMF prior to 
the HF cleavage. The acetamidomethyl group of cysteine can 
be removed by mercury (II) acetate and alternatively by 

25 iodine, thallium (III) trif luoroacetate or silver 
tetraf luoroborate which simultaneously oxidize cysteine to 
cystine. Other strong acids used for tBoc peptide cleavage 
and deprotection include trif luoromethanesulf onic acid 
(TFMSA) and trimethylsilyltrif luoroacetate (TMSOTf ) . 

30 A zampl polypeptide ligand-binding polypeptide 

can also be used for purification of ligand. The 
polypeptide is immobilized on a solid support, such as 
beads of agarose, cross -linked agarose, glass, cellulosic 
resins, silica-based resins, polystyrene, cross-linked 

35 polyacrylamide, or like materials that are stable under the 
conditions of use. Methods for linking polypeptides to 
solid supports are known in the art, and include amine 
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chemistry, cyanogen bromide activation, N- 

hydroxysuccinimide activation, epoxide activation, 
sulfhydryl activation, and hydrazide activation. The 
resulting medium will generally be configured in the form 
5 of a column, and fluids containing ligand are passed 
through the column one or more times to allow ligand to 
bind to the receptor polypeptide. The ligand is then 
eluted using changes in salt concentration, chaotropic 
agents (guanidine HC1) , or pH to disrupt ligand- receptor 
10 binding. 

An assay system that uses a ligand-binding 
receptor (or an antibody, one member of a complement/ anti- 
complement pair) or a binding fragment thereof, and a 
commercially available biosensor instrument (BIAcore™, 

15 Pharmacia Biosensor, Piscataway, NJ) may be advantageously 
employed. Such receptor, antibody, member of a 

complement /ant i -complement pair or fragment is immobilized 
onto the surface of a receptor chip. Use of this 
instrument is disclosed by Karlsson, J. Immunol. Methods 

20 145:229-40, 1991 and Cunningham and Wells, J. Mol . Biol. 
234:554-63, 1993. A receptor, antibody, member or fragment 
is covalently attached, using amine or sulfhydryl 
chemistry, to dextran fibers that are attached to gold film 
within the flow cell. A test sample is passed through the 

25 cell. If a ligand, epitope, or opposite member of the 
complement /ant i- complement pair is present in the sample, 
it will bind to the immobilized receptor, antibody or 
member, respectively, causing a change in the refractive 
index of the medium, which is detected as a change in 

30 surface plasmon resonance of the gold film. This system 
allows the determination of on- and off -rates, from which 
binding affinity can be calculated, and assessment of 
stoichiometry of binding. 

Ligand-binding receptor polypeptides can also be 

35 used within other assay systems known in the art. Such 
systems include Scatchard analysis for determination of 
binding affinity (see Scatchard, Ann. NY Acad. Sci. 51: 
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660-72, 1949) and calorimetric assays ( Cunningham et al., 
Science 253:545-48, 1991; Cunningham et al., Science 
245:821-25, 1991) . 

Zampl polypeptides can also be used to prepare 
5 antibodies that specifically bind to zampl polypeptide 
epitopes, peptides or polypeptides. Methods for preparing 
polyclonal and monoclonal antibodies are well known in the 
art (see, for example, Sambrook et al . , Molecular Cloning: 
A Laboratory Manual , Second Edition , Cold Spring Harbor, 

10 NY, 1989; and Hurrell, J. G. R., Ed., Monoclonal Hybridoma 
Antibodies: Techniques and Appl ications, CRC Press, Inc., 
Boca Raton, FL, 1982) . As would be evident to one of 
ordinary skill in the art, polyclonal antibodies can be 
generated from a variety of warm-blooded animals, such as 

15 horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
and rats. 

The immunogenicity of a zampl polypeptide may be 
increased through the use of an adjuvant, such as alum 
(aluminum hydroxide) or Freund's complete or incomplete 

20 adjuvant. Polypeptides useful for immunization also 
include fusion polypeptides, such as fusions of zampl 
polypeptide or a portion thereof with an immunoglobulin 
polypeptide or with maltose binding protein. The 
polypeptide immunogen may be a full-length molecule or a 

25 portion thereof. If the polypeptide portion is "hapten- 
like", such portion may be advantageously joined or linked 
to a macromolecular carrier (such as keyhole limpet 
hemocyanin (KLH) , bovine serum albumin (BSA) or tetanus 
toxoid) for immunization. 

30 As used herein, the term "antibodies" includes 

polyclonal antibodies, affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 
fragments, such as F(ab')2 and Fab proteolytic fragments. 
Genetically engineered intact antibodies or fragments, such 

35 as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen- 
binding peptides and polypeptides, are also included. Non- 
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human antibodies may be humanized by grafting only non- 
human CDRs onto human framework and constant regions, or by 
incorporating the entire non-human variable domains 
(optionally "cloaking" them with a human- like surface by 
5 replacement of exposed residues, wherein the result is a 
"veneered" antibody) . In some instances, humanized 

antibodies may retain non-human residues within the human 
variable region framework domains to enhance proper binding 
characteristics. Through humanizing antibodies, biological 

10 half-life may be increased, and the potential for adverse 
immune reactions upon administration to humans is reduced. 
Alternative techniques for generating or selecting 
antibodies useful herein include in vitro exposure of 
lymphocytes to zampl protein or peptide, and selection of 

15 antibody display libraries in phage or similar vectors (for 
instance, through use of immobilized or labeled zampl 
protein or peptide) . 

Antibodies are defined to be specifically binding 
if: 1) they exhibit a threshold level of binding activity, 

20 and/or 2) they do not significantly cross-react with 
related polypeptide molecules. First, antibodies herein 
specifically bind if they bind to a zampl polypeptide, 
peptide or epitope with a binding affinity (Ka) of 10 6 mol" 1 
or greater, preferably 10 7 mol" 1 or greater, more preferably 

25 10 8 mol" 1 or greater, and most preferably 10 9 mol" 1 or 
greater. The binding affinity of an antibody can be 
readily determined by one of ordinary skill in the art, for 
example, by Scatchard analysis (G. Scatchard, Ann. NY Acad. 
Sci. 51: 660-72, 1949). 

30 Second, antibodies specifically bind if they do 

not significantly cross-react with related polypeptides. 
Antibodies do not significantly cross-react with related 
polypeptide molecules, for example, if they detect human 
zampl polypeptide, but not known related polypeptides using 

35 a standard Western blot analysis (Ausubel et al., ibid.). 
Examples of known related polypeptides are orthologs, that 
is, proteins from the same species that are members of a 
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protein family, such as other known human P-defensins 
(e.g., hBD-1 and hBD-2) , mutant human P-defensins, and non- 
human P-defensins. 

Moreover, antibodies may be "screened against" 
5 known related polypeptides to isolate a population that 
specifically binds to the inventive polypeptides. For 
example, antibodies raised to human zampl polypeptide are 
adsorbed with related polypeptides adhered to an insoluble 
matrix; antibodies specific to human zampl polypeptide will 

10 flow through the matrix under the proper buffer conditions. 
Such screening allows isolation of polyclonal and 
monoclonal antibodies non-crossreactive to closely related 
polypeptides (see, Antibodies: A Laboratory Manual, Harlow 
and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988; 

15 Current Protocols in Immunology, Cooligan et al. (eds.), 
National Institutes of Health, John Wiley and Sons, Inc., 
1995) . Screening and isolation of specific antibodies is 
well known in the art (see, Fundamental Immunology, Paul 
(ed.), Raven Press, 1993; Getzoff et al . , Adv. Immunol . 

20 43:1-98, 1988; Monoclonal Antibodies: Principles and 
Practice, Goding, J.W. (eds.), Academic Press Ltd., 1996; 
Benjamin et al., Ann . Rev . Immunol . 2:67-101, 1984). 

A variety of assays known to those skilled in the 
art can be utilized to detect antibodies which specifically 

25 bind to zampl proteins or peptides. Exemplary assays are 
described in detail in Antibodies: A Laborato ry Manual. 
Harlow and Lane (Eds.), Cold Spring Harbor Laboratory 
Press, 1988. Representative examples of such assays 
include: concurrent Immunoelectrophoresis, 

30 radioimmunoassay, radioimmuno-precipitation, enzyme-linked 
immunosorbent assay (ELISA) , dot blot or Western blot 
assay, inhibition or competition assay, and sandwich assay. 
In addition, antibodies can be screened for binding to 
wild-type versus mutant zampl protein or peptide. 

35 Antibodies to zampl polypeptides may be used for 

tagging cells that express zampl polypeptides; for 
isolating zampl polypeptides by affinity purification; for 
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diagnostic assays for determining circulating levels of 
zampl polypeptides; for detecting or quantitating soluble 
zampl polypeptide as a marker of underlying pathology or 
disease; in analytical methods employing FACS; for 
5 screening expression libraries; for generating anti- 
idiotypic antibodies; and as neutralizing antibodies or as 
antagonists to block ant i -microbial activity in vitro and 
in vivo. 

For pharmaceutical use, the proteins of the 

10 present invention are formulated for topical, inhalant or 
parenteral, particularly intravenous or subcutaneous, 
delivery according to conventional methods. Intravenous 
administration will be by bolus injection or infusion over 
a typical period of one to several hours. In general, 

15 pharmaceutical formulations will include a zampl protein in 
combination with a pharmaceutically acceptable vehicle, 
such as saline, buffered saline, 5% dextrose in water or 
the like . Formulations may further include one or more 
excipients, preservatives, solubilizers, buffering agents, 

2 0 albumin to prevent protein loss on vial surfaces, etc. 
Methods of formulation are well known in the art and are 
disclosed, for example, in Remington: The Science and 
Practice of Pharmacy . Gennaro, ed., Mack Publishing Co., 
Easton, PA, 19th ed. , 1995. Therapeutic doses will 

25 generally determined by the clinician according to accepted 
standards, taking into account the nature and severity of 
the condition to be treated, patient traits, etc. 
Determination of dose is within the level of ordinary skill 
in the art. The proteins may be administered for acute 

30 treatment, over one week or less, often over a period of 
one to three days or may be used in chronic treatment, over 
several months or years. 

The invention is further illustrated by the 
following non-limiting examples. 
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Example 1 
Extension of EST Sequence 

The novel zampl polypeptides of the present 
5 invention were initially identified by querying an EST 
database for homologous sequences to the SAP-1 human 
defensin isolated from human psoriatic skin. A single EST 
sequence was discovered in a bronchial epithelium cDNA 
library and was predicted to be related to the P-defensin 

10 family. A second search based upon the P-defensin 
consensus motif also identified the EST. 

To identify the corresponding cDNA, a clone 
containing the EST was sought, but was not located. 
Oligonucleotides ZC14741 (SEQ ID NO: 5), ZC14740 (SEQ ID 

15 NO: 6) were used in a PGR reaction to isolate the zampl 
polypeptide -encoding sequence from human genomic DNA. 
Reaction conditions were 94°C for 1 minute and 30 seconds, 
followed by 35 cycles of 94°C for 10 seconds, 58°C for 20 
seconds and 72°C for 20 seconds, followed by 72°C for ten 

20 minutes. As a template, 100 ng of human genomic DNA was 
used, and Clontech Advantage PCR mix (Clontech, Palo Alto, 
California) was used as the polymerase mixture. The 
resulting 113 bp fragment was then purified on a' 3.2% 
NuSieve (FMC Bioproducts, Rockland, ME) gel using a QiaexII 

25 Gel Extraction Kit (Qiagen, Inc., Chatsworth, California) 
according to the manufacturer's directions. The purified 
material was used as a template for sequencing. The 
template was sequenced on an ABIPRISM ™ model 377 DNA 
sequencer (Perkin-Elmer Cetus, Norwalk, Ct . ) using the ABI 

30 PRISM™ Dye Terminator Cycle Sequencing Ready Reaction Kit 
(Perkin-Elmer Corp.) according to manufacturer's 
instructions. Oligonucleotides ZC14741 (SEQ ID NO: 5), 
ZC14740 (SEQ ID NO: 6) were used as primers for sequencing 
the clone. Sequencing reactions were carried out in a 

35 Hybaid OmniGene Temperature Cycling System (National Labnet 
Co., Woodbridge, NY). SEQUENCHER™ 3. 1 sequence analysis 
software (Gene Codes Corporation, Ann Arbor, MI) was used 
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for data analysis. The resulting 113 bp sequence is 
disclosed in SEQ ID NO : 1 . Comparison of the originally 
derived EST sequence with the sequence represented in SEQ 
ID NO: 1 showed that there were 2 base pair differences 
5 which resulted in 1 amino acid difference between the 
deduced amino acid sequences . Note that one of the base 
pair differences were from unknown "N" residues in the EST 
sequence to known residues in SEQ ID NO: 1. 

Generally, one or a combination of several 

10 techniques could be used to obtain the full length sequence 
of the zampl polypeptide -encoding polynucleotide. First, 
if one or more additional ESTs are identified that contig 
to the clone sequenced above, clones corresponding to such 
ESTs can be ordered and sequenced as described above and 

15 spliced together with the original sequence to form the 
full length sequence. If a small portion of the full 
length sequence is absent, 5* RACE reactions can be done, 
and the resulting fragments can be sequenced as described 
above and spliced together with the original sequence to 

20 form the full length sequence. Also, one or more cDNA 
libraries can be probed with all or a portion of SEQ ID NO: 
1 to identify a putative full-length clone. Such a full 
length clone can be sequenced as described above. 

25 Example 2 

Tissue Distribution 

Northerns were performed using Human Multiple 
Tissue Blots from Clontech (Palo Alto, CA) . An 

3 0 approximately 113 bp DNA probe, based directly on the 
identified EST, was generated using PCR techniques, 
specifically a 35 cycle reaction with an annealing 
temperature of 58°C using Clontech Advantage KlenTaq 
Polymerase mix (Clontech, Palo Alto, California) . The DNA 

35 probe was radioactively labeled with^P using REDI PRIME® DNA 
labeling system (Amersham, Arlington Heights, Illinois) 
according to the manufacturer's specifications. The probe 
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was purified using a NUCTRAP push column (Stratagene 
Cloning Systems, La Jolla, CA) . EXPRESSHYB (Clontech, Palo 
Alto, CA) solution was used for prehybridization and as a 
hybridizing solution for the Northern blots. Hybridization 
5 took place overnight at 55°C, and the blots were then 
washed in 2X SSC and 0.1% SDS at RT, followed by a wash in 
0.1X SSC and 0.1% SDS at 50°C. No expression was observed. 
It thus appears that normal tissue levels of mRNA of zampl 
polypeptide are below the detection sensitivity of the 
10 Northern blot . Such an observation is consistent with the 
knowledge in the art regarding de fens ins, i.e. . that they 
are constitutively expressed at low levels but are highly 
inducible upon infection. 

15 Example 3 

Chromosomal Mapping of the Zampl Gene 

The zampl gene was mapped to chromosome 8 using 
the commercially available "GeneBridge 4 Radiation Hybrid 

20 Panel" (Research Genetics, Inc., Huntsville, AL) . The 
GeneBridge 4 Radiation Hybrid Panel contains PCRable DNAs 
from each of 93 radiation hybrid clones, plus two control 
DNAs (the HFL donor and the A23 recipient) . A publicly 
available WWW server (http://www-genome.wi.mit.edu/cgi- 

25 bin/contig/rhmapper .pi) allows mapping relative to the 
Whitehead Institute/MIT Center for Genome Research's 
radiation hybrid map of the human genome (the "WICGR" 
radiation hybrid map) which was constructed with the 
GeneBridge 4 Radiation Hybrid Panel. 

30 For the mapping of the zampl gene with the 

"GeneBridge 4 RH Panel", 20 fil reactions were set up in a 
PCRable 96-well microtiter plate (Stratagene, La Jolla, CA) 
and used in a "RoboCycler Gradient 96" thermal cycler 
(Stratagene) . Each of the 95 PCR reactions consisted of 2 

35 /il 10X KlenTaq PCR reaction buffer (CLONTECH Laboratories, 
Inc., Palo Alto, CA) , 1.6 fil dNTPs mix (2.5 mM each, 
PERKIN- ELMER, Foster City, CA) , 1 /il sense primer, ZC 
14,780 (SEQ ID NO: 7), 1 [il antisense primer, ZC 14,776 
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(SEQ ID NO: 8), 2 (il "RediLoad" (Research Genetics, Inc., 
Huntsville, AL) , 0.4 pi 50X Advantage KlenTaq Polymerase 
Mix (Clontech Laboratories, Inc.), 25 ng of DNA from an 
individual hybrid clone or control and x /il ddH20 for a 
5 total volume of 20 /xl. The reactions were overlaid with an 
equal amount of mineral oil and sealed. The PCR cycler 
conditions were as follows: an initial 1 cycle 5 minute 
denaturation at 95°C, 35 cycles of a 1 minute denaturation 
at 95°C, 1 minute annealing at 52°C and 1.5 minute extension 

10 at 72°C, followed by a final 1 cycle extension of 7 minutes 
at 72°C. The reactions were separated by electrophoresis on 
a 2% agarose gel (Life Technologies, Gaithersburg, MD) . 

The results showed that the zampl gene maps 33.5 
cR 3000 from the top of the human chromosome 8 linkage 

15 group on the WICGR radiation hybrid map. Proximal and 
distal framework markers were CHLC.GATA62D10 and WI-3823 
(D8S1511) , respectively. The use of surrounding markers 
positions the zampl gene in the 8p23.3-p23.2 region on the 
integrated LDB chromosome 8 map (The Genetic Location 

20 Database, University of Southhampton, WWW server: 
http: //cedar. genetics, soton.ac.uk/ public_html/) . 



Example 4 

Identification of DNA Encoding Full Length Zampl 
25 Polypeptide and Sequencing Thereof 



The 5 1 end of zampl coding sequence was obtained 
by PCR using GenomeWalker® reagents (Clontech) in 
combination with a zampl polypeptide- specif ic antisense 

30 primer ZC15591 (SEQ ID NO: 12) and then conducting nested 
PCR with ZC15589 (SEQ ID NO: 13) , according to 
manufacturer's instructions, with the exception that 64°C 
was used in the primary reaction instead of the suggested 
67°C. PCR products were run on a 2% agarose gel (Gibco) , 

35 and gel purified using Qiaex II reagents (Qiagen) according 
to manufacturer's instructions. Products were sequenced 
using ZC15589 (SEQ ID NO: 13) as a sequencing primer, 
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revealing the extension of zampl polypeptide -encoding to a 
putative initiation methionine (nucleotides 1-201 or 1-219 
of SEQ ID NO: 9) and about 250 base pairs of 5' untranslated 
sequence . 

5 

Example 5 
Synthesis of Zampl 

10 A 45 amino acid residue zampl peptide (residues 

23 to 67 of SEQ ID NO: 2) was synthesized by solid phase 
peptide synthesis using a model 431A Peptide Synthesizer 
(Applied Biosystems/Perkin Elmer, Foster City, CA) . Fmoc- 
Lysine(Boc) resin (0.52 mmol/g; Anaspec Inc., San Jose, CA) 

15 was used as the initial support resin. 1 mmol Amino acid 
cartridges (Anaspec Inc., San Jose, CA and Applied 
Biosystems/Perkin Elmer, Foster City, CA) were used for 
synthesis . 2- (1-H-benzotriazol-l-yl) -1,1,3,3- 

tetramethyuroniumhexaf luorophosphate (HBTU) , 1-Hydroxy- 

20 benzotriazole (HOBt) , 2 M N, N-Diisopropylethylamine, N- 
Methylpyrrolidone, Dichlorome thane (all from Applied 
Biosystems/Perkin Elmer, Foster City, CA) , along with 
piperidine (Aldrich Chemical Co., St. Louis, MO) and 0.5 M 
acetic anhydride capping solution (Advanced ChemTech, 

25 Louisville, KY) , were used as synthesis reagents. 

The Peptide Companion software (Peptides 
International, Louisville, KY) was used to help predict the 
aggregation potential for the synthesis for zampl. 
Synthesis was performed using both single and double 

30 coupling cycles. Also, acetylation was used where 

difficult couplings were predicted. 

The peptide was cleaved from the solid phase by 
the standard TFA cleavage procedure (according to Peptide 
Cleavage manual, Applied Biosystems/Perkin Elmer) . 

35 Purification of the peptide was done by RP-HPLC using a 
CI 8, 52mm x 250mm, preparative column (Vydac, Hesperia, 
CA) . Fractions from the column were collected and analyzed 
for the correct mass by elect rospray mass spectrometry; the 
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purity was analyzed by analytical RP-HPLC, using a C18, 
4.6mm x 250mm column (Vydac, Hesperia, CA) . The mass 
spectrometry analysis confirmed the desired molecular 
weight of the reduced form of zampl, i.e., 5158. Purified 
5 fractions were frozen and then lyophilized. 

The reduced peptide was dissolved in 6 M 
guanidine HCl (Aldrich Chemical Co.) at an initial 
concentration of 2 mg/ml . This solution was then added 
slowly to 2 . 1 volume equivalents of 1 H guanidine HCl along 

10 with 0.52 volume equivalents of DMSO (Aldrich Chemical 
Co.). The oxidation was monitored with analytical RP-HPLC 
using the same analytical C18 column; the oxidation was 
complete at 48 hours. Salts were removed from the 

reaction mixture using solid phase extraction C18 

15 cartridges (Waters, Milford, MA) . The eluant containing 
the oxidized peptide is concentrated and then purified 
using RP-HPLC semi-prep C18 column (Vydac, Hesperia, CA) . 
Four distinct peaks were determined to correspond to the 
fully oxidized f orm . of zampl by electrospray LCMS. The 

20 peak referred to as peak 2, as it was the second peak to 
elude by RP-HPLC, was found to contain the conserved 
defensin disulfide pattern, by a process of elimination 
using partial digest and peptide mapping of all four peaks. 
This peak was isolated, frozen and lyophilized. 

25 From the foregoing, it will be appreciated that, 

although specific embodiments of the invention have been 
described herein for purposes of illustration, various 
modifications may be made without deviating from the spirit 
and scope of the invention. Accordingly, the invention is 

30 not limited except as by the appended claims. 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. An isolated protein comprising a polypeptide 
that is at least 80% identical to a polypeptide selected from 
the group consisting of: 

a) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 65 of SEQ ID NO: 2; 

b) a polypeptide having the sequence of amino acid 
residue 19 to amino acid residue 65 of SEQ ID NO: 2; 

c) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 65 of SEQ ID NO:2; 

d) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 67 of SEQ ID NO: 10; 

e) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 67 of SEQ ID NO: 10; and 

f) a polypeptide having the sequence of amino acid 
residue 23 to amino acid residue 67 of SEQ ID NO:2; 

wherein said polypeptide has cysteine residues 
corresponding to amino acid residues 33, 40, 45, 55, 62 and 63 
of SEQ ID N0s:2 or 10. 

2. An isolated protein of Claim 1, wherein said 
protein comprises a polypeptide having the sequence selected 
from the group consisting of: 

a) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 67 of SEQ ID NO: 10; 

b) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 67 of SEQ ID NO: 10; and 

c) a polypeptide having the sequence of amino acid 
residue 23 to amino acid residue 6 7 of SEQ ID NO: 10. 



3 . An isolated protein having the sequence of SEQ 
ID NO: 10 from amino acid residue 23 to amino acid residue 67. 
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4 . A pharmaceutical composition comprising a 
protein according to claim 1 in combination with a 
pharmaceutical ly acceptable vehicle. 

5. An antibody that specifically binds to a 
protein of Claim 1. 

6. An anti- idiotypic antibody of an antibody which 
specifically binds to a protein of Claim 1. 

7 . An isolated polynucleotide molecule encoding a 
protein, said polynucleotide molecule consisting of a coding 
strand and a complementary non- coding strand, wherein said 
polynucleotide molecule encodes a polypeptide that is at least 
80% identical to the amino acid sequence to a polypeptide 
selected from the group consisting of: 

a) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 65 of SEQ ID NO: 2; 

b) a polypeptide having the sequence of amino acid 
residue 19 to amino acid residue 65 of SEQ ID NO: 2; 

c) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 65 of SEQ ID NO: 2; 

d) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 67 of SEQ ID NO: 10; 

e) a polypeptide having the sequence of amino acid 
residue 21 to amino acid residue 67 of SEQ ID NO: 10; and 

f) a polypeptide having the sequence of amino acid 
residue 23 to amino acid residue 67 of SEQ ID NO: 2; 

wherein said polypeptide has cysteine residues 
corresponding to amino acid residues 33, 40, 45, 55, 62 and 63 
of SEQ ID NOs:2 or 10. 

8 . An isolated polynucleotide molecule encoding a 
protein having cysteine residues corresponding to amino acid 
residues 33, 40, 45, 55, 62 and 63 of SEQ ID NO: 10, said 
polynucleotide molecule consisting of a coding strand and a 
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complementary non- coding strand, wherein said polynucleotide 
comprises a nucleotide sequence that is at least 80% identical 
to the sequence of a polynucleotide selected from the group 

consisting of: 

a) a polynucleotide as shown in SEQ ID NO: 9 from 

nucleotide 220 to nucleotide 420; 

b) a polynucleotide as shown in SEQ ID NO: 9 from 
nucleotide 280 to nucleotide 420; and 

c) a polynucleotide as shown in SEQ ID NO -.9 from 
nucleotide 286 to nucleotide 420. 

9. An isolated polynucleotide molecule encoding a 
protein having cysteine residues corresponding to amino acid 
residues 33, 40, 45, 55, 62 and 63 of SEQ ID N0:10, said 
polynucleotide molecule consisting of a coding strand and a 
complementary non- coding strand, wherein said polynucleotide 
comprises a nucleotide sequence as shown in SEQ ID NO: 11. 

10. An expression vector comprising the following 
operably linked elements: 

a transcription promoter; 

a DNA segment encoding a protein of claim 1; and 
a transcription terminator. 

11. An expression vector according to claim 10 
wherein said DNA segment further encodes a secretory signal 
sequence operably linked to said protein. 

12. An expression vector according the claim 11, 
wherein said secretory signal sequence is selected from the 
group consisting of : 

a) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 18 of SEQ ID NO: 2; 

b) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 20 of SEQ ID NO: 2; 

c) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 20 of SEQ ID NO: 10; and 
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d) a polypeptide having the sequence of amino acid 
residue 1 to amino acid residue 22 of SEQ ID NO: 10. 

13 . A cultured cell into which has been introduced 
an expression vector comprising the following operably linked 
elements : 

a transcription promoter ; 

a DNA segment encoding a protein according to claim 

1 ; and 

a transcription terminator; 

wherein said cell expresses said protein encoded by 
said DNA segment. 

14. A method of producing a protein comprising: 
culturing a cell into which has been introduced an expression 
vector comprising the following operably linked elements: 

a transcription promoter; 

a DNA segment encoding a protein according to claim 

1 ; and 

a transcription terminator; 

whereby said cell expresses said protein encoded by 
said DNA segment ; and 

recovering said expressed protein. 



15. An oligonucleotide probe or primer comprising 
at least 14 contiguous nucleotides of a polynucleotide of SEQ 
ID N0:11 or a sequence complementary to SEQ ID NO:ll. 
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0 : . : : 

7 TATCTTCTGTTTGCTTTGCTCTTCCTGTTTTTGGTGCCTGTTCCAGGT CATGGAGGA 



12YLLFSFLFIFLMPLPGVFGG 



60 

64 ATCATAAACACATTACAGAAATAANATTGCAGAGTCAGAGGCGGCCGGTGTGCTGTGCTC 



32IGDPVT CLKSGAICHPV 



120 .:".:.:.:.:.: 

124 AGCTGCCTTCCAAAGGAGGAACAGATCGGCAAGTGCTCGACGCGTGGCCGAAAATGCTGC 



49FCPRRYKQIGTCGLPGTKCC 



180 

184 CGAAGA 



69 K K 



Fig. 2 
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SEQUENCE LISTING 

<110> ZymoGenetics. Inc. 

1201 Eastlake Avenue East 
Seattle. Washington 98102 
United States of America 

<120> NOVEL BETA- DEFENS INS 

<130> 97-44PC 

<150> 60/058,335 
<151> 1997-10-09 

<150> 60/064.294 
<151> 1997-11-05 

<160> 13 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 219 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)...(195) 

<400> 1 

atg agg ate cat tat ctt ctg ttt get ttg etc ttc ctg ttt ttg gtg 48 
Met Arg He His Tyr Leu Leu Phe Ala Leu Leu Phe Leu Phe Leu Val 
15 10 15 

cct gtt cca ggt cat gga gga ate ata aac aca tta cag aaa tat tat 96 
Pro Val Pro Gly His Gly Gly He He Asn Thr Leu Gin Lys Tyr Tyr 
20 25 30 

tgc aga gtc aga ggc ggc egg tgt get gtg etc age tgc ctt cca aag 144 
Cys Arg Val Arg Gly Gly Arg Cys Ala Val Leu Ser Cys Leu Pro Lys 
35 40 45 
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gag gaa cag ate ggc aag tgc teg acg cgt ggc cga aaa tgc tgc cga 192 
Glu Glu Gin lie Gly Lys Cys Ser Thr Arg Gly Arg Lys Cys Cys Arg 
50 55 60 

aga aagaaataaa aaccctgaaa catg 219 
Arg 
65 



<210> 2 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Arg lie His Tyr Leu Leu Phe Ala Leu Leu Phe Leu Phe Leu Val 
15 10 15 

Pro Val Pro Gly His Gly Gly He He Asn Thr Leu Gin Lys Tyr Tyr 

20 25 30 

Cys Arg Val Arg Gly Gly Arg Cys Ala Val Leu Ser Cys Leu Pro Lys 

35 40 45 

Glu Glu Gin He Gly Lys Cys Ser Thr Arg Gly Arg Lys Cys Cys Arg 
50 55 60 

Arg 
65 

<210> 3 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Cysteine motif of the Beta-defensin family 

<221> VARIANT 
<222> (2)... (7) 

<223> Each Xaa is independently any amino acid residue, 
preferably not cysteine 

<221> VARIANT 
<222> (9)... (12) 

<223> Each Xaa is independently any amino acid residue, 
preferably not cysteine 



<221> VARIANT 
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<222> (14)... (20) 

<223> Each Xaa is independently any amino acid residue, 
preferably not cysteine 

<221> VARIANT 
<222> (22) . . . (22) 

<223> Xaa is any amino acid residue, preferably not 
cystei ne 

<221> VARIANT 
<222> (24)... (29) 

<223> Each Xaa is independently any amino acid residue, 
preferably not cysteine 

<400> 3 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Gly Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys 
20 25 30 

<210> 4 
<211> 213 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Degenerate nucleotide encoding the polypeptide of 
SEQ ID NO: 2 

<221> variation 
<222> (1)...(213) 

<223> Each N is independently any nucleotide 
<400> 4 

athcaytayy tnytnttygc nytnytntty ytnttyytng tnccngtncc nggncayggn 60 

ggnathatha ayacnytnca raartrrnnn tgymgngtnm gnggnggnmg ntgygcngtn 120 

ytnwsntgyy tnccnaarga rgarcarath ggnaartgyw snacnmgngg nmgnaartgy 180 

tgymgnmgna araartrraa rccntrraay atg 213 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Oligonucleotide ZC14741 
<400> 5 

gagcacttgc cgatctgttc 20 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14740 
<400> 6 

ccaggtcatg gaggaatcat 20 

<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14780 
<400> 7 

ggaggaatca taaacaca 18 

<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14776 
<400> 8 

gccgatctgt tcctcctt 18 

<210> 9 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
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<222> (220). ..(420) 
<400> 9 

acaaatccat agggagctct gccttaccat tgggttccta attaactgag tgagtgggtg 60 

tgttctgcat ggtgagaggc attggaatga tgcatcagaa aacatgtcat aatgtcatca 120 

ctgtaatatg acaagaattg cagctgtggc tggaaccttt ataaagtgac caagcacacc 180 

ttttcatcca gtctcagcgt ggggtgaagc ctagcagct atg agg ate cat tat 234 

Met Arg lie His Tyr 
1 5 



ctt 


ctg ttt 


get ttg 


etc ttc 


ctg 


ttt 


tug 


gtg ccx 


gtt cca 


ggt cat 


cod 


Leu 


Leu Phe 


Ala Leu 


Leu Phe 


LcU 


rne 


LcU 


Wal Dm 

Va 1 rTO 


Va 1 rTO 


biy HIS 








10 








ID 






on 




gga 


gga ate 


ata aac 


aca tta 


cag 


3 3 3 

dad 


tat 
Id L 


tat tnr 

Lat tgc 


3fi3 nf r 

aga gtc 


aga ggc 


ion 
ooU 


Gly 


Gly He 


lie Asn 


Thr Leu 


Gin 


uyb 


iyr 


iyi ^ji 


Am Val 
mi y va i 


g u i y 








25 






oU 






JD 






ggc 


egg tgt 


get gtg 


etc age 




rtt 


f~f*a 


aan nan 

day gag 


naa ran 
ydd Cay 


d ix yg c 


O/O 


Gly 


Arg Cys 


Ala Val 


Leu Ser 




1 PI I 


Prr> 


1 v<; Glu 

L. Jr o U 1 U 


Glu Gin 


Tip Glv 






40 














all 






aag 


tgc teg 


acg cgt 


ggc cga 


aaa 


tgc 


tgc 


cga aga 


aag aaa 




420 


Lys 


Cys Ser 


Thr Arg 


Gly Arg 


Lys 


Cys 


Cys 


Arg Arg 


Lys Lys 








55 




60 








65 








taaaaaccct gaaacatg 
















438 




<210> 


10 




















<211> 


67 




















<212> 


PRT 




















<213> 


Homo sapiens 


















<400> 


10 


















Met 


Arg He 


His Tyr Leu Leu 


Phe 


Ala 


Leu 


Leu Phe 


Leu Phe 


Leu Val 




1 




5 








10 






15 




Pro 


Val Pro 


Gly His Gly Gly 


He 


He 


Asn 


Thr Leu 


Gin Lys 


Tyr Tyr 








20 






25 






30 






Cys 


Arg Val 


Arg Gly Gly Arg 


Cys 


Ala 


Val 


Leu Ser 


Cys Leu 


Pro Lys 






35 






40 








45 






Glu 


Glu Gin 


He Gly Lys Cys 


Ser 


Thr 


Arg 


Gly Arg 


Lys Cys 


Cys Arg 






50 




55 








60 








Arg 


Lys Lys 




















65 
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<210> 11 
<211> 219 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Degenerate nucleotide sequence encoding the 
polypeptide of SEQ ID NO: 10 

<221> variation 
<222> CI). . .(219) 

<223> Each N is independently any nucleotide 
<400> 11 

atgmgnathc aytayytnyt nttygcnytn ytnttyytnt tyytngtncc ngtnccnggn 60 
cayggnggna thathaayac nytncaraar trrnnntgym gngtnmgngg nggnmgntgy 120 
gcngtnytnw sntgyytncc naargargar carathggna artgywsnac nmgnggnmgn 180 
aartgytgym gnmgnaaraa rtrraarccn trraayatg 219 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 2C15591 
<400> 12 

tgccgatctg ttcctccttt g 21 

<210> 13 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC15589 



<400> 13 
gaacaggcac caaaaacagg aagag 



25 
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